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Description 

Technicai Field 

5 [0001] The present invention relates to novel proteins having a cytidine deaminase activity; DNAs and fragnnents 
thereof (cDNAs, genomic DNAs, and primer DNAs) encoding the proteins; expression vectors comprising the DNAs; 
transformants transfomied with the expression vectors; antibodies reactive to the proteins or fragments thereof; cells 
producing the antibodies; and methods for identifying substances that regulate production of the proteins, transcription 
of genes encoding the proteins into mRNAs, or enzyme activities of the proteins. 

10 

Background Art 

[0002] The germinal center of mammals comprises a highly specialized microenvironment required for the final proc- 
ess of maturation towards antigen specific memory cells and long-lived plasma cells (Enr^o J., Vol.16, No.11, p. 
is 2996-3006, 199 ; Semin. Immunol., Vol.4, No.1 , p.11-17, 1992). In the microenvironment, It is known that two funda- 
mental editing of the immunoglobulin genes take place (J. Exp. Med., Vol.173, No.5, p.1165-1175, 1991; Embo. J., 
Vol.12. No.13, P.4955-49B7. 1993; Adv. Exp. Med. BioL, Vol.1 86. p.145-151, 1985; Nature, Vol.342, No.6252, p. 
929-931. 1989; Cell. Vol.67, No.6, p.1121-1129). 

[0003] One is the somatic hypemnutation (Cun-. Opin. Immunol., Vol.7, No.2. p.248-254, 1995; Annu. Rev. Immunol., 

20 Voi.14, p.441-457, 1996; Science, Vol.244, No.4909, p.1152-1157, 1989), a phenomenon in which extensive point 
mutation of exon genes encoding variable regions of immunoglobulin occurs. Accumulation of the point mutation leads 
to selection of B cells expressing high affinity immunoglobulins on their cell surface, accompanied by the affinity mat- 
uration of antibodies (Embo. J., Vol.4, No,2, p.345-350, 1985; Proc: Natl. Acad. Sci. USA, Vol.85, No^l , p.8206-8210, 
1988). As the result, immunoglobulin genes are edited as new functional genes. 

25 [0004] Another is the dass switch recombination (CSR). In the recombination, effector functions of antibodies, such 
as complement fixation, are selected by exchanging axons encoding constant region of immunoglobulin heavy chain 
(Cun^.Top. MicrobioL Immunol., Vol.217, p.151-169, 1996; Annu. Rev. Immunol., Vol.8, p.717-735, 1990). 
[0005] These two types of genetic editing are very important for effective humoral immunoreaction to eliminate harm- 
ful microbes. The molecular mechanisms of the genetic phenomena have not yet elucidated despite the extensive 

30 studies for several decades. 

[0006] The present inventors isolated mouse B cell clone, CH12F3-2, as a research tool to elucidate the molecular 
mechanism of class switch recombination of immunoglobulin. In the B cell line, class switch recombination (CSR) from 
IgM to IgA begins several hours after stimulation with IL-4, TGF-p, and CD40L. and finally, over 80% of the cells become 
IgA positive (Immunity, Vol.9, p.1-10, 1998; Curr. Biol., Vol.8, No.4, p.227-230, 1998; Int. Immunol., Vol.8, No.2, p. 

35 193-201,1996). 

[0007] Using the mouse B cell clone CH12F3-2, the present Inventors had reported that the breakpoints of CSR 
distribute not only in switch region (S region), characteristic repeated sequences, but also in the neighboring sequences 
(Curr. BioL, Vol.8, No.4, p.227-230, 1998). However, the breakpoints were rarely seen in I exon and C exon, locating 
at upstream and downstream of S region, respectively. Also, according to the accumulated scientific evidence, it has 
40 been shown that the transcription of I exon and 0 exon and the splicing of the transcripts are essential for CSR (Cell, 
Vol.73, No.6, p. 1155-1264, 1993; Science, Vol.259, No.5097, p.984-9B7, 1993; Proc. Natl. Acad. Scl, USA, Vol.90, 
No.8, p.3705-3709, 1993; Cell. Vol.BI , No.6, p.B33-B36, 1995). 

[0008] This namely suggests that the transcripts are involved in CSR either directly or indirectly. Accordingly, the 
present inventors propose a theory that class switch is initiated by the recognition of DNA-RNA complex stmcture and 
45 not by the recognition of nucleotide sequences of switch region. This Idea is further fortified by the fact that even in 
the case that Sa region is substituted with Se region or Syregion by introducing a mini-chromosome to above-mentioned 
mouse B cell clone CH12F3-2, CSR in the mini-chromosome efficiently occurs by stimuli of cytokines (immunity, Vol. 
9, p.1-10, 1998). 

[0009] In plants and Protozoa, RNA editing, another type of genetic editing, Is widely used as a mean for producing 
50 functional genes from limited genome (Cell, Vol.BI , No.6, p.833-836, 1 995; Cell. Vol.BI . No.6, p.837-B40. 1 995). mRNA 
editing of many molecules such as mRNA of apolipoprotein B (apoB), AMPA receptors, Wilmstumor-1 , a-galactosidase 
and neurofibromatosis type-1, and tRNA-Asp, have been reported (Trends Genet, Vol.12, No.10, p.418-424, 1996; 
Cun". Opin. Genet. Dev., Vol.6, No.2, p.221-231 , 1 996). Although the molecular mechanism of mammalian RNA editing 
has not yet been elucidated, one perfonned by APOBEC-1 {apolipoprotein B mRNA editing enzyme, catalytic polypep- 
55 tide-1) becomes understood by degrees (Science, VO1.260, No.5115, p.1816-181 9, 1993; J. Biol. Chem., Vol.268, No. 
28, p.20709-20712, 1993). 

[0010] In apoB RNA editing, the first base C (cytosine) of codon CAA. which encodes glutamine, is converted to U 
(uridine), which alters the codon to UAA. As the result, in-frame stop codon is made In the apoB mRNA (J. Cell., Vol. 
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81 , No.2. p.1 87-1 95, 1 995; J. Cell., Vol.50, No.6, p.831 -840, 1 987; Science, Vol.238, No.4B25, p.363-266, 1 987). apoB- 
48 and apoB-1 00 are transcripts of edited nnRNA and unedited nnRNA of apoB. respectively, and these proteins possess 
totally different physiological functions for each other (J. Biol. Chem., Vol.271 , No.5, p.2353-2356, 1996). 
[001 1 ] In the srte-spedfic RNA-editing, auxiliary factors are required (Science , VOt.260, No.51 1 5, p.1 816-1 81 9, 1 993; 

5 J. Biol. Chem.. Vo!.26B. No.28, p. 20709-2071 2, 1993). In the absence of auxiliary factors, APOBEC-1 shows only a 
cytidine deaminase activity, possessing non-specific low affinity to RNA (J. Biol. Chem., Vol56B, No.28, p.20709-2071 2, 
1993 ; J. Cell. , Vol.81, No.2, p.187-195, 1995; J. Biol. Chem., Vol.270. No.24, p.1 4768-14775, 1995; J. Biol. Chem.. 
Vol.270, No.24. p.14762-1 4767, 1 995). The expression and activity of the auxiliary factors are found not only in organs 
with apoB mRNA editing, but also In organs with undetectable level of APOBEC-1 expression, or organs without apoB 

10 mRNA editing (Science. VOI.260. No.51 15. p.1 81 6-1 81 9, 1993; J. Biol. Chem., Vol.268, No.28, p.20709-20712, 1993; 
Nucleic Acids Res., Voi22, No.10. p.1874-1879, 1994; Proc Natl. Acad. Scl. USA, Vol.91, No.18, p.8522-8526, 1994; 
J. Biol. Chem.. VoL269, No.34, p.21 725-21 734. 1994). 

[001 2] The unexpected expression of the auxiliary factors involved In apoB mRNA editing suggests that the auxiliary 
factors may be involved in more general cellular function or otheryet unknown RNA editing. Since there are possibilities 
IS that CSR and hypermutatlon, which are genetic editing relating to Immunoglobulin, may be accomplished by RNA 
editing, it is very interesting to elucidate whether RNA editing takes place or not in the genetic editing of immunoglobulin 
gene mentioned above. 

Disclosure of the Invention 

20 

[0013] The present invention provides AID (Activation-Induced cytidine Deaminase) , a novel cytidine deaminase 
having stmctural relationship to APOBEC-1, one of RNA editing enzyme, and involved in RNA editing in germinal 
center B cells, where genetic editing of immunoglobulin gene occurs, and a DNA encoding the enzyme. 
[0014] The present inventors intensively searched for novel genes involved in class switch recombination (CSR), 

25 one of major genetic editing of Immunoglobulin gene. As a result, by preparing cDNA libraries for mouse B cell done 
CH12F3-2, in which class switch recombination from IgM to IgA is shown to occur at an extremely high rate together 
with activation of cells by stimulation with cytokines, with and witiiout stimulating witii cytokine, and performing sub- 
traction cloning using the libraries, the present inventors found genes encoding mouse- and human-derived novel 
proteins named AID (Activation-Induced cytidine Deaminase), having stmctural relationship to APOBEC-1 , one of RNA 

30 editing enzymes, and having a cytidine deaminase activity similar to APOBEC-1 . 

[0015] The AID protein in the present invention possesses features described below, and considered to be a very 
important RNA-modifying deaminase involved in regulating B cell activation, CSR of immunoglobulin gene, somatic 
hypermutatlon, and affinity maturation, which all are genetic editing specific to germinal center function: 

35 (1) ORF of cDNA encoding AID protein comprises 198 amino acids, with 24kDa calculated molecular weight 
(mouse: SEQ ID NO: 2, and human: SEQ ID NO: 8). Mouse AID protein shows approximately 2BkDa molecular 
weight by SDS-PAGE. 

(2) The amino acid sequence of AID protein is 34% and 26% Identical to APOBEC-1 (apolipoprotein B mRNA 
editing enzyme, catalytic polypeptide-1) at amino acid sequence level, for mouse and human derived protein re- 

40 spectively. 

(3) AID protein has cytidine/deoxycyttdine deaminase motif, which is the active center of the deaminase activity 
conserved in amino acid sequences of proteins belonging to cytosine nucleoside/nucleotide deaminase family. 

(4) Cytidine deaminase motif of AID protein is allied with RNA editing deaminase subgroup. 

(5) AID protein has Leucine-rich region considered to be important In protein-protein interaction, similar to 
45 APOBEC-1 . Four leucines In leuclne-rich region of the AID protein are conserved in leucine-rich region of APOBEC- 

1 in rabbit, rat, mouse and human. 

(6) In the primaiy structure of AID protein, all amino acid residues reported to be necessary for APOBEC-1 to bind 
RNA (Phe66, PheB7, His61, Glu63 and Cys93) are consen/ed. 

(7) AID protein has pseudoactive site domain in its C terminal for fonrting homodimer, simitar to APOBEC-1 and 
so ECDDA, an E. coll derived cytidine deaminase. There are possibilities that AID protein forms homodimer. or as- 
sociates with other auxiliary proteins. 

(8) AID protein shows a concentration-dependent cytidine deaminase activity. The activity can be inhibited dose 
dependentiy by tetrahydrouridine (THU), a specific inhibitor of cytidine deaminase. Also, a zinc chelator, 1 ,1 0-o- 
phenanthroline, inhibits the cytidine deaminase activity of AID protein while 1 ,7-o-phenanthrotine, the Inactive 

ss isomer, shows a weak inhibition. Thus, AID protein can considered to be a zinc-dependent cytidine deaminase as 

APOBEC-1. 

(9) Strong expression of AID mRNA expression is seen in lymph nodes (mesenteric and amygdaline) . Also, weak 
expression in spleen is seen. 
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(lO).mRNA expression of AID protein is seen in a variety of lymphoid tissues (Peyer's patch, mesenteric lymph 
node, axillary lymph node, spleen, and bone marrow) . Especially, notable expression is seen in peripheral lym- 
phoid organs, such as lymphatic nodes and Peyer's patch. Contrariwee, expression in primary lymphoid organs 
is lower than the peripheral lymphoid organs. 
5 (11 ) Expression of AID mRNA Is at detection limit level without cytokines (IL-4, CD40L, TGF-P) stimulation in mouse 

B cell clone CH12F3-2, in which the cytokines stimulate class switch from tgM to IgA in the celts, whereas the 
expression is induced 3 hours after stimulation, and maximal expression Is seen after 12 hours, with cytokine 
stimulation. 

(1 2) AID mRNA expression in mouse B cell clone CHI 2F3-2 can be induced strongly when stimulated with three 
10 cytokines, IL-4, CD40L and TGF-p. simultaneously, rather than stimulated with any one of them. Also, it can be 

considered that de novo protein synthesis is necessary for augmentation of AID mRNA expression, as the AID 
mRNA expression induction by cytokines in mouse B cell clone CH12F3-2 can be inhibited by cycloheximide, an 
protein synthesis Inhibitor. 

(1 3) In the in vitro test, an augmentation of AID mRNA expression can be seen when normal mouse spleen B cells 
15 are stimulated with LPS alone, LPS4-1L-4. or LPS+TGF-p. 

(14) In ttie in vivo test, when normal mice are immunized with sheep red blood cells (SRBC), a significant aug- 
mentation of AID mRNA expression can be seen 5days after immunization, in which SRBC are known to induce 
clonal expansion, germinal center fomiation, and class switch recombination and affinity maturation of immu- 
noglobulin gene. 

20 (15) The in vivo augmentation of AID mRNA expression by SRBC immunization is specifically seen in splenic 

CD1 9 positive B cells. 

(16) AID mRNA expression In lymphoid organs is specifically seen in the germinal center, enriched with B cells 
activated by antigen stimulation. 

(17) Human AID gene locates at locus 12p13, close to the locus 12p13.1 , where APOBEC-1 gene locates. 

25 

[001 6] According to the characteristics described above, the AID protein of the present invention can be considered 
to have a function of regulating various biological mechanisms required for generation of antigen-specific immunoglob* 
ulins (specific antibodies), which eliminate non-self antigen (foreign antigen, self-reacting cells, etc.) that triggers var- 
ious diseases. The mechanism for generation of immunoglobulin having high specificity to antigens includes germinal 
30 center functions such as activation of B cells, class switch recombination of immunoglobulin gene, somatic hypemnu- 
tatlon, and affinity maturation. The AID protein of the present invention can be considered to be one of the enzymes 
that play an important role in the genetic editing occurring in germinal center B cells (e.g. class switch recombination 
' and somatic mutation). 

[0017] The dysfunction of the AID protein of the present invention can be the cause for the humoral immunodeficiency 
35 since it Induces failure of gemilnal center B cell function, such as antigen-specific B cell activation, class switch re- 
combination, and somatic mutation. Reversely, the hyperfunction of AID protein may Induce allergy disease or autoim- 
mune disease since It can cause inappropriate B cell activation and needless class switch recombination and somatic 
mutation. 

[0018] Therefore, regulation of the function of AID protein and the gene encoding it enables preventing and treating 
40 various immunodeficiencies, autoimmune diseases, and allergies, which result from, for example, B cell dysfunctions 
(e.g. IgA deficiency, IgA nephropathy, y globullnemia, hyper IgM syndrome, etc.) or class switch deficiency of immu- 
noglobulin. Thus, the AID protein and the gene encoding the AID protein can be targets for the development of drugs 
for therapy of diseases mentioned above. 

[0019] Examples of diseases whose onset prevention, symptom remission, therapy and/or symptomatic treatment 
45 effect is expected by regulating the function of the AID protein of the present invention or the gene encoding it include, 
for example, primary immunodeficiency syndrome with congenital disorder of immune system, mainly various Immu- 
nodeficiencies considered to develop by B cell deficiency, decrease, or dysfunction (e.g., sex-linked agammaglobuline- 
mia, sex-linked agammaglobulinemia with growth hormone deficiency, Immunoglobulin deficiency with high IgM level, 
selective IgM deficiency, selective IgE deficiency, immunoglobulin heavy chain gene deletion, k chain deficiency, IgA 
50 deficiency, IgG subclass selective deficiency. CVID (common variable Immunodeficiency), infantile transient dysgam- 
maglobuiinemia, Rosen syndrome, severe combined Immunodeficiency (sex-linked, autosomal recessive), ADA (ad- 
enosine deaminase) deficiency, PNP (purine nucleoside phosphorylase) deficiency, MHC class II deficiency, reticular 
dysplasia, Wiskott-Aldrich syndrome, ataxia telangiectasia, DIGeorge syndrome, chromosomal aberration, familial Ig 
hypermetabolism, hyper IgE syndrome, Gitiin syndrome, Nezelof syndrome, Good syndrome, osteodystrophy, transco- 
S5 balamtn syndrome, secretory bead syndrome, etc.) , various diseases with antibody production deficiency that are 
secondary Immunodeficiency syndrome with disorder of immune system caused by an acquired etiology (for example, 
AIDS, etc.) , and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, conjunctivitis, allergic rtiinitis, 
allergic enteritis, drug-induced allergy, food allergy, allergk: urticaria, glomerulonephritis, etc.). 
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[0020] Namely, the AID protein of the present invention, a fragnient thereof, a DNA encoding the AID protein, a 
fragment thereof, and an antibody against the AID protein are useful as reagents for developing drugs for prevention 
and therapy of such diseases. 

[0021] Also, the DNA itself is useful as an antisense drug regulating the function of AID gene at a gene level and In 
5 a use in gene therapy. The protein or the fragments thereof (e.g. enzyme active site) Itself is useful as a drug. 

[0022] Furthermore, a DNA comprising a complementary nucleotide sequence to an arbitrary partial nucleotide se> 
quence in the nucleotide sequence of genomic DNA encoding AID protein of the present invention (especially human 
AID protein) is useful as a primer DNA for polymerase chain reaction (PCR). 

[0023] An arbitrary partial nucleotide sequence of genomic DNA encoding the AID protein (especially human AID 
10 protein) of the present invention can be amplified by PCR using the primer DNA pair. For example, in the case that 
mutation or deletion of the nucleotide sequence of genomic DNA (especially exon) encoding AID pmtein is presumed 
to cause a certain immunodeficiency or an allergy, mutations and deletions in the genomic DNA can be identified by 
amplifying an arbitrary partial nucleotide sequence of genomic DNA encoding the AID protein obtained from tissue or 
cells of immunodeficiency or allergy patients by PCR using a pair of the primer DNA, by analyzing the presence and 
IS the size of PCR products, and the nucleotide sequence of the PCR products, and by comparing the nucleotide sequence 
with the con'esponding nucleotide sequence in the genomic DNA encoding the AID protein derived from the normal 
human. That is to say, this method Is capable of not only, for example, elucidating relationships between immunode- 
ficiency or allergy and AID protein, but also. In the case where AID protein is the cause of onset of a sort of disease 
(e.g. immunodeficiency and/or allergy), diagnosing the diseases by the methods mentioned above. 
20 [0024] Furthermore, an antibody reactive to the AID protein of the present invention or a fragment thereof is extremely 
useful as an antibody dmg by regulating functions of the AID protein. 

[0025] Furthermore, the gene (DNA) , protein, and antibody of the present invention are useful as reagents for search- 
ing substrates (e.g. RNA. etc.) interacting (binding) with the protein (enzyme) of the present invention, or other auxiliary 
proteins associated with the protein of the present invention, and for developing drugs targeting the substrates and 
25 auxiliary proteins. 

[0026] Also, model animals can be generated by disrupting (Inactivating) the AID gene base on the genetic informa- 
tion on the AID protein derived from mammals (e.g. mouse, etc.). which is one embodiment of the DNA of present 
Invention. By analyzing the physical, biological, pathological, and genetic features of the model animal, It Is possible 
to elucidate functions of the genes and the proteins of the present Invention. 

30 [0027] Furthemiore, by introducing normal human AID gene or mutant human AID gene (e.g. mutant human AID 
genes derived from immunodeficiency patients), which is one embodiment of the present invention, into the model 
animal whose endogenous gene has been disrupted, model animals having only nonnal or mutant human AID genes 
of the present invention can be generated. By administering dmgs (compounds, antibodies, etc.) targeting the Intro- 
duced human AID genes to the model animals, therapeutic effects of the drugs can be evaluated. 

35 [0028] Furthermore, a method for identifying a substance tiiat regulates production of the AID protein of the present 
invention or transcription of a gene encoding the AID protein into mRNA, or a substrate that inhibits the enzyme activity 
of the AID protein (e.g. cytldine deaminase activity) are extremely useful as means to develop drugs for therapy and 
prevention of various diseases (especially, immunodeficiency and/or allergy) in which the above-mentioned AID protein 
or AID gene Is considered to be Involved. 

40 [0029] Thus, the present invention, for the first time, provides blow-mentioned DNAs (cDNAs. genomic DNAs. and 
an arbitrary fragment thereof), proteins, expression vectors, transformants, antibody pharmaceutical compositions, 
cells, the use of the DNA fragments as primer DNAs, and methods for screening. 

(1) A DNA or a fragment thereof encoding a protein comprising the amino acid sequence of SEQ ID NO: 2 or 8. 
45 (2) The DNA or the fragment of (1 ), wherein the protein has a cytldine deaminase activity. 

(3) A DNA or a fragment thereof comprising the nucleotide sequence of SEQ ID NO: 1 or 7. 

(4) A DNA or a fragment thereof comprising a nucleotide sequence of (a) or (b) below: 

(a) a nucleotide sequence comprising the nucleotide residues 93 to 689 of SEQ ID NO: 1 or 
so (b) a nucleotide sequence comprising the nucleotide residues 80 to 676 of SEQ ID NO: 7. 

(5) A DNA or a fragment tiiereof of (a) or (b) below: 

(a) a DNA or a fragment thereof that hybridizes under stringent conditions with a DNA comprising the nucleotide 
ss sequence of SEQ ID NO: 1 and that encodes a mammal-derived protein being homologous to a protein that 

comprises the amino acid sequence of SEQ ID NO: 2 and having a cytidine deaminase activity or 

(b) a DNA or a fragment thereof that hybridizes under stringent conditions witii a DNA comprising the nucleotide 
sequence of SEQ ID NO: 7 and that encodes a mammal-derived protein being homologous to a protein that 
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comprises the amino add sequence of SEQ ID NO: 8 and having a cytidine deaminase activity. 

(6) A protein or a fragment thereof comprising the amino acid sequence of SEQ ID NO: 2 or B. 

(7) A protein or a fragment thereof comprising substantialiy the same amino acid sequence as that of SEQ ID NO: 
5 2 or 8 and having a cytidine deaminase activity. 

(8) A proteins of (a) or (b) below. 

(a) a mammal-derived protein that comprises an amino acid sequence encoded by a DNA hybridizing under 
stringent conditions with a DNA comprising the nucleotide sequence of SEQ ID NO: 1 , that is homologous to 

10 a protein comprising the amino acid sequence of SEQ ID NO: 2. and that has a cytidine deaminase activity or 

(b) a mammal-derived protein that comprises an amino acid sequence encoded by a DNA hybridizing under 
stringent conditions with a DNA comprising the nucleotide sequence of SEQ ID NO: 7, that is homologous to 
a protein comprising the amino acid sequence of SEQ ID NO: 8, and that has a cytidine deaminase activity 

15 (9) An expression vector comprising the DNA or the fragment of any one of (1 ) to (5). 

{1 0) A transformant transfomied with the expression vector of (9). 

(11) An antibodies or a portion thereof reactive to the protein of any one of (6) to (8) orto a fragment of the protein. 

(12) The antibodies or the portion of (11), wherein the antibody is a monoclonal antibody. 

(13) A phamiaceutical composition comprising the antibody or the portion of (11) or (12), and a pharmaceutically 
20 acceptable carrier. 

(14) A cell producing a monoclonal antibody reactive to the protein of any one of (6) to (B) orto a fragment of the 
protein. 

(15) The cell of (14), wherein the cell is a hybridoma obtained by fusing, with a mammal-derived myeloma ceil, a 
non-human marrmial-derived B cell that produces a monoclonal antibody. 

25 (16) The cell of (15) , wherein the cell Is a transgenic cell transfomied by introducing, into a cell, either or both of 
a DNA encoding a heavy chain of the monoclonal antibody and a DNA encoding a light chain of the monoclonal 
antibody 

(17) A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one of (a) to (c) below: 

30 \a) SEQ ID NO: 9. 

(b) SEQ ID NO: 10. or 

(c) SEQ ID NO: 35. 

(IB) A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one of (a) to (e) below: 

35 

(a) SEQ ID NO: 11, 

(b) SEQ ID NO: 12, 

(c) SEQ ID NO: 13. 

(d) SEQ ID NO: 14. or 
40 (e) SEQ ID NO: 15. 

(19) A DNA comprising a complementary nucleotide sequence to an arbitrary partial nucleotide sequence of a 
nucleotide sequence of any one of (a) to (h) below: 

45 (a) SEQ ID NO: 9, 

(b) SEQ ID NO: 10, 

(c) SEQ ID NO: 11, 

(d) SEQ ID NO: 12. 

(e) SEQ ID NO: 13, 
so (f) SEQ ID NO: 14, 

(g) SEQ ID NO: 15. or 

(h) SEQ ID NO: 25. 

(20) The DNA of (19), wherein the DNA comprises a nucleotide sequence of any one of (a) to (q) below: 

55 

(a) SEQ ID NO: 18, 

(b) SEQ ID NO: 19, 

(c) SEQ ID NO: 20, 
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(d) SEQ ID NO: 21. 

(e) SEQ ID NO: 22. 
(0 SEQ ID NO: 23, 

(g) SEQ ID NO: 24. 

(h) SEQ ID NO: 25. 

(i) SEQ ID NO: 26, 
(j) SEQ ID NO: 27, 
(k) SEQ ID NO: 2B, 
(I) SEQ ID NO: 29. 
(m) SEQ ID NO: 30, 
(n) SEQ ID NO: 31 . 

(0) SEQ ID NO: 32, 
(p) SEQ ID NO: 33. or 
(q) SEQ ID NO: 34. 

(21) Use of the DNA of (19) or (20) as a primer DNA in polymerase chain reaction. 

(22) Use of a pair of DNA of any one of (a) to (n) below as primer DNAs in polymerase chain reaction: 

(a) a DNA comprising the nucleotide sequence of SEQ ID NO: 31 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 32, 

(b) a DNA comprising the nucleotide sequence of SEQ ID NO: 20 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 22, 

(c) a DNA comprising the nucleotide sequence of SEQ ID NO: 21 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 30, 

(d) a DNA comprising the nucleotide sequence of SEQ ID NO: 24 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 25. 

(e) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 27, 

(f) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 28. 

(g) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 29. 

(h) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 27, 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 2B. 

(j) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 29, 

(k) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 28, 

(I) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 29, 

(m) a DNA comprising the nucleotide sequence of SEQ ID NO: 33 and a DNA comprising the nucleotide 
sequence of SEQ ID NO: 29, or, 

(n) a DNA comprising the nucleotide sequence of SEQ ID NO: 18 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 19. 

(23) A method for identifying a substance that regulates transcription of a gene encoding an AID protein comprising 
the amino acid sequence of SEQ ID NO: 2 or 8 into mRNA. or production of the AID protein, the method comprising 
the steps of: 

(a) culturing, separately in the presence and the absence of the substance, cells producing the AID protein and 

(b) (i) comparing the level of the AID protein produced by the cells cultured in the presence of the substance 
with the level of the AID protein produced by the ceils cultured in the absence of the substance or 

(ii) comparing the level of the AID protein-encoding mRNA transcribed in the cells cultured in the presence 
of the substance with the level of the AID protein-encoding mRNA transcribed in the cells cultured in the 
absence or the substance. 
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(24) Amethod for identifying a substance that regulates transcription of a gene encoding an AID protein comprising 
the amino acid sequence of SEQ ID NO: 2 or 8 Into mRNA, or production of the AID protein, the method comprising 
the steps of: 

5 (a) culturing, separately in the presence and the absence of the substance, celts producing the AID protein 

and a protein other than the AID protein, wherein transcription of a gene encoding the other protein into mRNA 
is dependent in the cells on the degree of a signal of transcription of the gene encoding the AID protein Into 
mRNA and 

' (b) comparing the level of the other protein produced by the cells cultured In the presence of the substance 
10 with the level of the other protein produced by the cells cultured In the at>sence of the substance. 

(25) The method of (23) or (24), wherein the cells are transgenic cells transformed with a gene encoding the protein. 

(26) The method of (24), wherein the cells are transgenic cells transformed with a gene encoding the protein and 
a gene encoding the other protein. 

IS (27) The method of (26), wherein the protein is a reporter protein. 

(28) The method of (27), wherein comparison of the level of the other protein is comparison of the level of a signal 
generated by the reporter protein. 

(29) The method of (27) or (28), wherein the reporter protein is lucif erase. 

(30) A method for identifying a substance that Inhibits an enzyme activity of an AID protein comprising the amino 
20 acid sequence of SEQ ID NO: 2 or 8, the method comprising the step of (a) or (b) below: 

(a) culturing, separately In the presence and the absence of the substance, mammal-derived B cells or tissues 
comprising the B cells, and comparing enzyme activities of the AID protein in the B cells separately cultured or 

(b) (I) administering the substance separately to an AID gene knockout mouse whose endogenous AID gene 
25 Is inactivated so that transcription of the endogenous Al D gene into mRNA is Inhibited, and to a normal mouse 

and 

(ii) comparing enzyme activities of the AID proteins in the B cells Isolated from the respective mice. 

(31) The method of (30), wherein the enzyme activity is a cytidine deaminase activity. 

30 

[0030] Hereafter, the present invention is explained in detail, by clarifying the terms used in the present invention 
and general methods for producing the proteins, DNAs. antibodies, and cells of the present invention. 
[0031] The "protein or a fragment thereor means a protein and a fragment thereof derived from a mammal such as 
human, bovine, sheep, pig, goat, rabbit, rat, hamster, guinea pig, mouse, and so on, preferably a protein or a fragment 
3S thereof derived fmm fiuman, rabbit, rat, or mouse, and particulariy preferably, a protein of a fragment thereof derived 
from human or mouse. 

[0032] As a particularly prefen-ed embodiment, it means any protein or a fragment thereof below. 

(1) A protein or a fragment thereof comprising the amino acid sequence of SEQ ID NO: 2 or 8. 
<o (2) A protein or a fragment thereof comprising substantially the same amino acid sequence as that of SEQ ID NO: 

2 or 8 and having a cytidine deaminase activity. 

(3) A mammal-derived protein that comprises an arnino acid sequence encoded by a DNA hybridizing under strin- 
gent conditions with a DNA comprising the nucleotide sequence of SEQ ID NO: 1 , that is homologous to a protein 
comprising the amino acid sequence of SEQ ID NO: 2, and that has a cytidine deaminase activity 
45 (4) A mammal-derived protein that comprises an amino acid sequence encoded by a DNA hybridizing under strin- 

gent conditions with a DNA comprising the nucleotide sequence of SEQ ID NO: 7, that is homologous to a protein 
comprising the amino acid sequence of SEQ ID NO: 8, and that has a cytidine deaminase activity. 

[0033] Here, "having substantially the same amino acid sequence" means that a protein has an amino acid sequence 
so where multiple amino acids, preferably 1 to 10 amino acids, particulariy preferably 1 to 5 amino acids, in the amino 
acid sequence shown in the references are substituted, deleted, and/or modified, and that a protein has an amino acid 
sequence where multiple amino acids, preferably 1 to 10 amino acids, particulariy preferably 1 to 5 amino acids, are 
added to the amino acid sequence shown in the references. 

[0034] The protein of the present Invention Includes monomer molecule, homodimer in which one strarid binds to 
55 another strand comprising an identical amino acid sequence, heterodimer in which one strand binds to another strand 
comprising a different amino acid sequence, and oligomers such as trimer or tetramer. 

[0035] Also, the "fragment of a protein" means an arbitrary partial sequence (fragment) in the amino acid sequence 
that the above-mentioned AID protein of the present invention comprises. For example, it includes an enzyme active 
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site required for the AID protein to exert an enzyme activity represented by a cytidine deaminase activity, and an 
interaction site required for the AID protein to bind or associate with substrates (e.g. mRNA, etc.) or various auxiliary 
proteins. 

[0036] Alphabetical triplet or single letter codes used to represent amino acids in the present specification or figures 

5 mean amino acids as follows: 

[0037] (Gly/G), glycine; (Ala/A), alanine; {ValAO. valine; (Leu/L), leucine; (lle/l), isoleucine; (Ser/S), serine; (Thr/T), 
threonine; (Asp/D). asparticacid; (Glu/E), glutamic acid; (Asn/N). asparagines; (GIn/Q) glutamine; (Lys/K). lysine; (Arg/ 
R). arginine; (Cys/C). cysteine; (Mel/M), methionine; (Phe/F), phenylalanine; (Tyr/Y). tyrosine; (Trp/W), tryptophan; 
(His/H), histidlne; (Pro/P), proline. 

10 [0038] The proteins and fragments of the present Invention can be produced by property using. In addition to genetic 
engineering technique mentioned below, methods well known In the art, such as chemical synthesis, cell culture meth- 
od, and so on, or their modified methods. 

[0039] Also, the AID protein of the present Invention can be produced as a recombinant fusion protein with other 
protein (e.g. GST (Glutathione S-transf erase), etc.). In this case, the fusion protein is advantageous in that it can be 

IS extremely easily purified by affinity chromatography employing adsorbent on which other protein molecule binding 
speciricaily to GST is immobilized. Moreover, since various antibodies reactive to GST are provided, the quantification 
of the fusion protein can be simply carried out by immunoassay (e.g. ELISA, etc.) using the antibodies against GST. 
[0040] The DNA of the present invention Is a DNA encoding protein of the present invention and a fragment thereof, 
and it includes any nucleotide sequence encoding the protein of the present Invention and includes both genomic ON As 

20 and cDNAs. Also, the DNA includes any DNA composed of any codons as long as the codons encode identical amino 
acids. 

[0041] Also, the DNA of the present invention includes a DNA encoding mammal AID protein, and, as a preferred 
embodiment, a DNA encoding mouse AID protein or human AID protein can be exemplified. 
[0042] Examples of specific embodiments are as follows: 

25 

(1) A DNA encoding a protein comprising the amino acid sequence of SEQ ID NO: 2 or 8. 

(2) The DNA of (1), wherein the protein has a cytidine deaminase activity. 

(3) A DNA comprising the nucleotide sequences of SEQ ID NO: 1 or 7. 

(4) A DNA comprising the nucleotide residues 93 to 6B9 of SEQ ID NO: 1 . 
30 (5) A DNA comprising the nucleotide residues 80 to 676 of SEQ ID NO: 7. 

(6) A DNA that hybridizes under stringent conditions with a DNA comprising the nucleotide sequence of SEQ ID 
NO: 1 and that encodes a mammal-derived protein being homoiogous to a protein that comprises the amino acid 
sequence of SEQ ID NO: 2 and having a cytidine deaminase activity. 

(7) A DNA that hybridizes under stringent conditions with a DNA comprising the nucleotide sequence of SEQ ID 
35 NO: 7 and that encodes a mammal-derived protein being homologous to a protein that comprises the amino acid 

sequence of SEQ ID NO: B and having a cytidine deaminase activity. 

(8) A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one of (a) to (c) below: 

(a) SEQ ID NO: 9, 

40 (b) SEQ ID NO: 10, or 

(c) SEQ ID NO: 35. 

(9) A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one of (a) to (e) below: 

45 (a) SEQ ID NO: 11, 

(b) SEQ ID NO: 12, 

(c) SEQ ID NO: 13, 

(d) SEQ ID NO: 14, or 

(e) SEQ ID NO; 15. 

50 

(10) A DNA comprising a complementary nucleotide sequence to an arbitrary partial sequence of a nucleotide 
sequence of any one of (a) to (h) below: 

(a) SEQ ID NO: 0, 
55 (b) SEQ ID NO: 10, 

(c) SEQ ID NO: 11, 

(d) SEQ ID NO: 12, 

(e) SEQ ID NO: 13. 
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(OSEQIDNO: 14. . 

(g) SEQIDNO:15.or 

(h) SEQ ID NO: 35. 

5 (11 ) A DNA comprising a nudeotide sequence of any one of (a) to (q) below: 

(a) SEQ ID NO: 18, 

(b) SEQ ID NO: 19. 

(c) SEQ ID NO: 20, 
10 (d) SEQ ID NO: 21, 

(e) SEQ ID NO: 22. 
(0 SEQ ID NO: 23. 

(g) SEQ ID NO: 24, 

(h) SEQ ID NO: 25, 
15 (i) SEQ ID NO: 26, 

(j) SEQ ID NO: 27, 
(k) SEQ ID NO: 28. 
(I) SEQ ID NO: 29, 
(m) SEQ ID NO: 30, 
20 (n) SEQ ID NO: 31, 

(o) SEQ ID NO: 32, 
(p) SEQ ID NO: 33, or, 
(q) SEQ ID NO: 34. 

25 p)043] Furthermore, a DNA encoding a mutant protein or a fragment thereof obtained by substituting, deleting, and/ 
or modifying multiple amino acids, preferably 1 to 10 amino acids, particularly preferably 1 to 5 amino acids, or by 
inserting multiple amino adds, preferably 1 to 10 amino acids, particularly preferably 1 to 5 amino adds In ttie amino 
acid sequence constituting the above-defined AID protein of the present invention or a fragment thereof is included in 
the DNA of the present invention. 

30 [0044] The term "under stringent conditions" used herein means, for example, the following conditions. For example, 
in the case of carrying out hybridization using a probe with not less than 50 bases in 0.9% NaCl, target temperature 
of causing 50% dissociation (Tm) can be calculated from the formula below, and the hybridization temperature can be 
set as the fomriula below. 

35 

Tm = 82.3*0+0.41 x{G»+C)%-500/n-0.61x (fonmamide)% (n means the number of bases of the probe) 
Temperature = Tm-25*C 

40 [0045] Also, in the case of using a probe with not less than 1 00 bases (G+C = 40 to 50%), the changes of Tm as (1 ) 
and (2) below can be used as the indicator. 

(1) Every 1% mismatch decreases Tm by approximately 1"C. 

(2) Every 1% formamide decreases Tm by 0.6 to O.T'C. 

45 

[0046] Thus, the temperature condition in the case of combination of complete complementary strands can be set 
as below. 

(A) 65 to 75°C (without formamide) 
50 (B) 35 to 45"C (with 50% formamide) 

[0047] The temperature condition in the case of combination of incomplete complementary strands can be set as 
below. 

S5 (A) 45 to 55*C (without fomiamide) 

(B) 35 to 42**C (with 30% f onnamide) 

[0046] In the case of using probes with not more than 23 bases, temperature can be 37*0. or the formula below can 
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also be used as an indicator. 

Temperature = 2"C x (number of A+T)+4'C x (number of C+G)-5*C 

5 

[0049] The DNA of the present invention can be a DNA obtained by any method. For example, the DNA Includes 
complementary DNA (cDNA) prepared from mRNA, DNA prepared from genomic DNA. DNA prepared by chemical 
synthesis, DNA obtained by PCR amplification with RNA or DNA as a template, and DNA constructed by appropriately 
combining these methods. 

10 [0050] The DNA encoding the protein of the present invention can be prepared by the usual methods: cloning cDNA 
from mRNA encoding the protein of the present invention, isolating genomic DNA and splicing it, chemical synthesis, 
and so on. 

[0051] (1 ) cDNA can be cloned from mRNA encoding the protein of the present invention by, for example, the method 
described below. 

IS [0052] First, the mRNA encoding the protein of the present invention Is prepared from the above-mentioned tissues 
or cells expressing and producing the protein of the present invention. mRNA can be prepared by isolating total RNA 
by a known method such as guanidine-thiocyanate method (Chirgwin et al., Biochemistry, Vol.16, p.5294. 1979), hot 
phenol method, or AGPC method, and subjecting it to affinity chromatography using oligo-dT cellulose orpoly*U Sepha- 
rose. 

20 [0053] Then, with the mRNA obtained as a template, cDNA is synthesized, for example, by a well-known method 
using reverse transcriptase, such as the method of Okayama et al (Mol. Cell. Biol. VoL2, p. 161 (1982) ; Mol. Cell. Biol. 
Vol .3. P.2B0 (1983)) or the method of Hoffman et al. (Gene Vol.25, p.263 (1983)), and converted Into double-stranded 
cDNA. A cDNA library is prepared by transforming E. co// wrth plasmid vectors, phage vectors, or cosmid vectors having 
this cDNA or by transfecting E coli after in vitro packaging. 

2S [0054] The plasmid vectors used in this invention are not limited as long as they are replicated and maintained In 
hosts. Any phage vector that can be replicated in hosts can also be used. Exannples of usually used cloningvectors 
are pUC19, XgtIO, Xgtll, and so on. When the vector Is applied to immunological screening as mentioned below, a 
vector having a promoter that can express a gene encoding the desired protein in a host is preferably used. 
[0055] cDNA can be inserted into a plasmid by, for example, the method of Maniatis et al. (Molecular Cloning, A 

30 Laboratory Manual, second edition, Cold Spring Harbor Laboratory, p.1 .53, 1 989). cDNA can be inserted into a phage 
vector by, for example, the method of Hyunh et al. (DNA cloning, a practical approach, Vol.1, p.49 (1985)). These 
methods can be simply performed by using a commercially available cloningkit (for excunple, a product from Takara 
Shuzo). The recombinant plasmid or phage vector thus obtained is introduced Into an appropriate host ceil such as a 
prokaryote (for example, E. co//: HB101, DHSa, MC1061/P3, etc). 

35 [0056] Examples of a method for introducing a plasmid Into a host are, calcium chloride method, calciurn chloride/ 
rubidium chloride method and electroporation method, described in Molecular Cloning, A Laboratory Manual (second 
edition. Cold Spring Harbor Laboratory, p.1 .74 (1 989)). Phage vectors can be introduced Into host cells by, for example, 
a method in which the phage DMAs are introduced into grown hosts after in vitro packaging, in vitro packaging can be 
easily perfonned with a commercially available in vitro packaging kit (for example, a product from Stratagene or Am- 

^0 ersham). 

[0057] The identifk^ation of cDNA encoding protein, its expression being augmented depending on the stimulation 
of cytokines like AID protein of the present invention, can be earned out by for example suppression subtract hybridi- 
zation (SSH) (Proc. Natl. Acad. Sci. USA. Vol.93, p.6025-6030, 1996; Anal. Biochem., Vol.240, p.90-97. 1996) taking 
advantage of suppressive PCR effect (Nucleic Acids Res., Vol.23, p.1 087-1 086, 1 995), using two cDNA libraries, name- 
^ ly, cDNA library constructed from mRNA derived from stimulated cells (tester cDNA library) and that constructed from 
mRNA derived from unstimulated cells (driver cDNA library). 

[0058] The preparation of cDNA libraries required for subtraction cloning can be performed by using commercially 
available kit, for example, PCR-Select Subtraction Kit (CLONTECH, cat: K1 804-1). The experiment can be performed 
according to the document of procedure accompanying in the kit. 

50 [0059] An example of practical experimental procedure is listed below, briefly. 

[0060] PolyA-*^ RNA is prepared from cells with or without stimulation with appropriate stimulant as previously reported 
method (Nucleic Acids Res., Vol.26, No.4, p.911-91B, 1 998). Next, cDNA is prepared using reverse transcriptase from 
each polyA-*^ RNA samples, as is the commonly used method. cDNA prepared from stimulated cells is used as tester 
cDNA and that prepared from unstimulated cells as driver cDNA. 

S5 [0061] According to the previous report mentioned above and experimental manuals accompanying with kit, driver 
cDNA is added to tester cDNA to perform subtraction. The efficiency of subtraction is monitored by adding small amount 
of exogenous DNA as a control. After subtraction, the exogenous DNA is concentrated. 

[0062] The subtracted cDNA is cloned into appropriate plasmid expression vector to constmct a plasmid library by 
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commonty used method. * 

[0063] Similar to the previous report, many colonies are screened by differential hybridization method (Nucleic Acids 
Res.. Vol.26, No.4, p.911-91B. 1998; RINSYO-MEN-EKI. Vol.29. No.Suppl.17. p.451-459, 1997). Here, as the hybrid- 
ization probes, tester cDNA and driver cDNA mentioned above labeled with radioisotope can be used. Clones con- 
5 taming the objective DNA and that containing exogenous DNA can be distinguished by hybridizing the exogenous DNA 
with replicant fitters. 

[0064] Objective cDNA or its fragment can be obtained by selecting clones giving strong signal against radiolabeled 
tester cDNA probe rather than radiolabeled driver cDNA probe. 

[0065] Also, cDNA encoding the protein of the present Invention can be accomplished by other general cDNA screen- 
10 ing method. 

[0066] For instance, cDNA or its fragment encoding the protein of the present Invention cloned by subtraction cloning 
method mentioned above, or chemically synthesized oligonucleotides corresponding to amino acid sequence of the 
protein of the present Invention, are labeled with to make probes, then by well-known colony hybridization method 
(Crunstein et al., Proc. Natl. Acid. Sci. USA, Vol.72, p.3961 . 1 975) or plaque hybridization method (Molecular Cloning. 
IS A Laboratory Manual, second edition, Cold Spring Hari^or Laboratory, pJZA 08, 1 989), commercial or originally prepared 
cDNA libraries can be screened. Furthennore, a method to amplify DNA including cDNA encoding the protein of the 
present invention by PCR, by constructing a pair of PCR primer based on cDNA or its fragment encoding the protein 
of the present invention isolated by the subtraction cloning mentioned above, can be listed. 

[0067] When a cDNA library prepared using a cDNA expression vector is used, the desired clone can be screened 
20 by the antigen-antibody reaction using an antibody against the desired protein. A screening method using PCR method 
is preferably used when many ctones are subjected to screening. 

[0088] The nucleotide sequence of the DNA thus obtained can be determined by Maxam-Gilbert method (Maxam et 
al. Proc. Natl, Acad, Sci. USA, Vol.74, p.560 (1977)) or the dideoxynucleotide synthetic chain tennination method using 
phage M13 (Sanger et al. Proc. Natl. Acad. Sci. USA, Vol.74, pp.5463-5467 (1 977)). The nucleotide sequence can be 
25 easily detemnined using a commercial DNA sequencer. 

[0069] The whole or a part of the gene encoding the protein of the present invention can be obtained by excising the 
clone obtained as mentioned above with restriction enzymes and so on. 

[0070] (2) Also, the DNA encoding the protein of the present invention can be isolated from the genomic DNA derived 
from the cells expressing the protein of the preserit invention as mentioned above by the following methods. 

30 [0071] Such cells are solubilized preferably by SDS or proteinase K, and the DNAs are deproteinized by repeating 
phenol extractiori. RNAs are digested preferably with ribonuclease. The DNAs obtained are partially digested with 
appropriate restriction enzymes, and the DNA fragments obtained are amplified with appropriate phage or cosmid to 
generate a library. Then, clones having the desired sequence are detected, for example, by using radioactively labeled 
DNA probes, and the whole or a portion of the gene encoding the protein of the present invention is obtained from the 

35 clones by excision with restriction enzymes, etc. 

[0072] For example, cDNA encoding a human-derived protein can be obtained by preparing a cosmid library into 
which human genomic DNAs (chromosomal DNAs) are Introduced ("Laboratory Manual Human Genome Mapping," 
M. Hori and Y, Nakamura, eds., Maruzen), screening the cosmid library to obtain positive clones containing DNA cor- 
responding to the coding region of the desirod protein, and screening the above cDNA iibmry using the coding region 

40 DNA excised from the positive clones as a probe. 

[0073] Also, the present invention relates to any fragment of DNA (cDNA, genomic DNA. etc.) encoding AID protein 
(especially human AID protein] of the present invention described above. DNA with complementary nucleotide se- 
quence to any nucleotide sequence of cDNA or genomic DNA is useful as a primer DNA in polymerase chain reaction 
(PCR). By PCR using a pair of the primer DNA, any pari:ial nucleotide sequence of genomic DNA encoding AID protein 

4S (especially human AID protein) of the present invention can be amplified. 

[0074] For Instance, in the case that mutation or deletion of genomic DNA (especially exon) encoding the Al D protein 
is presumed to cause a certain Immunodeficiency or allergy, the existence of such the mutation or deletion can be 
analyzed by PCR described below. 

so (1 ) Prepare a pair of primers comprising complementary nucleotide sequence to any partial nucleotide sequence 

of genomic DNA encoding AID protein of the present Invention. 

(2) Amplify the objective partial nucleotide sequence of the genomic DNA using the pair of primers, using genomic 
DNA encoding AID protein obtained from tissue or cells of immunodeficiency or allergy patients as templates. 

(3) Analyze the existence of PCR products and the nucleotide sequence of the PCR products, and identify the 
ss mutation and deletion in the genomic DNA by comparing the nucleotide sequence and corresponding nucleotide 

sequence of genomic DNA encoding AID protein derived from normal human. 

[0075] Thus, the method described above can not only elucidate, for example, the relation between immu nodef iclen- 
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cy and/or allergy and AID protein, but also be used for the diagnosis of a certain kind of disease, in the case that AID 
protein Is the cause of the disease. 

[00761 Examples of the nucleotide sequence of the primer DNA are as follows: 

5 (1 ) A DNA comprising a complementary nucleotide sequence to an arbitrary partial sequence of a nucleotide 

sequence of any one of (a) to (h) below: 

(a) SEQ ID NO: 9, 

(b) SEQ ID NO: 10, 
10 (c) SEC ID NO: 11, 

(d) SEQ ID NO: 12, 

(e) SEQ ID NO: 13, 

(f) SEQ ID NO: 14, 

(g) SEQ ID NO: 15, or 
15 (h) SEQ ID NO: 35. 

(2) A DNA comprising a nucleotide sequence of any one of (a) to (q) below: 

(a) SEQ ID NO: 18, 
20 (bj SEQ ID NO: 19. 

(c) SEQ ID NO: 20. 

(d) SEQ ID NO: 21. 

(e) SEQ ID NO: 22, 
(0 SEQ ID NO: 23. 

25 (g) SEQ ID NO: 24, 

(h) SEQ ID NO: 25. 

(i) SEQ ID NO: 26, 
C) SEQ ID NO: 27, 
{k) SEQ ID NO: 28, 

30 (I) SEQ ID NO: 29, 

(m) SEQ ID NO: 30, 

(n) SEQ ID NO: 31, 

(o) SEQ ID NO: 32, 

(p) SEQ ID NO: 33. or. 
35 (q) SEQ ID NO: 34. 

[0077] Also, the present invention relates to the use of the above-mentioned DNA fragment as a primer DNA in 
polymerase chain reaction. 

[0078] Examples of the combination of primer DNAs for PGR in diagnosis accomplished by PGR gene amplification 
40 and by analyzing It are as follows: 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO: 31 and a DNA comprising the nucleotide sequence 

of SEQ ID NO: 32. 

(2) a DNA comprising the nucleotide sequence of SEQ ID NO: 20 and a DNA comprising the nucleotide sequence 
45 of SEQ ID NO: 22, 

(3) a DNA comprising the nucleotide sequence of SEQ ID NO: 21 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 30. 

(4) a DNA comprising the nucleotide sequence of SEQ ID NO: 24 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 25. 

so (5) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide sequence 
0fSEQIDN0:27, 

(6) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 28. 

(7) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide sequence 
55 of SEQ ID NO: 29. 

(8) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 27, 

(9) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide sequence 
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of SEQ ID NO: 28, 

(1 0) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 29, 

(11) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA comprising the nucleotide sequence 
5 of SEQ ID NO: 28, 

(1 2) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 29, 

(1 3) a DNA comprising the nucleotide sequence of SEQ ID NO: 33 and a DNA comprising the nucleotide sequence 
of SEQ ID NO: 29, or. 

10 (1 4) a DNA comprising the nucleotide sequence of SEQ ID NO: 1 B and a DNA comprising the nucleotide sequence 

of SEQ ID NO: 19. 

[0079] Moreover, the present invention also relates to a recombinant vector corhprising the DNA encodi ng the protein 
of the present invention. As a recombinant vector of the present invention, any vector can be used as long as it is 
f5 capable of retaining replication or self-multiplication In each host cell of prokaryotic and/or eukaryotic cells, including 
plasmid vectors and phage vectors. 

[0060] The recombinant vector can easily be prepared by ligating the DNA encoding protein of the present invention 
with a vector for recombination available in the art (plasmid DNA and bacteriophage DNA) by the usual method. 
[0081] Specific examples of the vectors for recombination used are E co/Kierived plasmids such as pBR322, 

20 pBR325. pUGI 2, pUCI 3, and pUCI 9, yeast-derived plasmids such as pSHI 9 and pSHI 5, and Badllus subtiHs-derived 
plasmids such as pUB11D, pTP5, and pC194. Examples of phages are a bacteriophage such as X phage, and an 
animal or insect virus (pVL1393, Invitrogen) such as a retrovirus, vaccinia virus, and nuclear potyhedrosis virus. 
[0082] An expression vector is useful for expressing the DNA encoding the protein of the present invention and for 
producing the protein of the present invention. The expression vector is not limited as long as it expresses the gene 

25 encoding the protein of the present invention in various prokaryotic and/or eukaryotic host cells and produces this 
protein. Examples thereof are pMAL C2, pEF-BOS (Nucleic Acids Res. Vol.18, p.5322 (1990) and so on), pMEIBS 
(Experimental Medicine: SUPPLEMENT, "Handbook of Genetic Engineering" (1992) and so on), etc. 
[0083] Also, the protein of the present invention can be produced as a fusion protein with other protein. It can be 
prepared as a fusion protein, for example, with GST (Glutathione S-transferase) by subcloning a cDNA encoding the 

30 protein of the present invention, for example, Into plasmid pGEX4T1 (Phamnacia), by transforming E. coii DH5a, and 
by culturing the transfomiant. 

[0084] When bacteria, particularly E. coU are used as host ceils, an expression vector generally comprises, at least, 
a promoter/operator region, an initiation codon, the DNA encoding the protein of the present Invention, termination 
codon, temiinator region, and replicon. 

35 [0085] When yeast, animal cells, or insect cells are used as hosts, an expression vector is preferably comprising, at 
least, a promoter, an initiation codon, the DNA encoding the protein of the present invention, and a termination codon. 
It may also comprise the DNA encoding a signal peptide, enhancer sequence, 5'- and 3'-untranslated region of the 
gene encoding the protein of the present invention, splicing junctions, polyadenylation site, selectable marker region, 
and replicon. The expression vector nriay also contain, if required, a gene for gene amplification (marker) that is usually 

40 used. 

[0086] A promoter/operator region to express the protein of the present invention in bacteria comprises a promoter, 
an operator, and a Shine-Datgamo (SD) sequence (for example, AAGG). For example, when the host is Escherichia, 
it preferably comprises Tip promoter, lac promoter, recA promoter, XPL promoter, Ipp promoter, tac promoter, or the 
like. Examples of a promoter to express the protein of the present invention in yeast are PH05 promoter, PGK promoter, 

^ GAP promoter, ADH promoter, and so on. When the host is Baattus, examples thereof are SL01 promoter, SP02 
promoter, penP promoter, and so on. When the host is a eukaryotic cell such as a mammalian cell, examples thereof 
are SV40-derived promoter, retrovirus promoter, heat shock promoter, and so on, arid preferably SV-40 and retrovlrus- 
derived one. As a matter of course, the promoter is not limited to the above examples. In addition, using an enhancer 
is effective for expression. 

so [0087] A preferable Initiation codon Is, for example, a methionine codon (ATG). 

[0088] A commonly used temnination codon (for example, TAG, TAA, TGA) is exemplified as a termination codon. 
[0089] Usually, used natural or synthetic terminators are used as a terminator region. 

[0090] Arepliconmeans a DNA capable of replicating the whole DNA sequence in host cells, and includes a natural 
plasmid, an artificially modified plasmid (DNA fragment prepared from a natural plasmid). a synthetic plasmid, and so 
55 on . Examples of preferable plasmids are pBR322 or its artificial derivatives (DNA fragment obtained by treating pBR322 
with appropriate restriction enzymes) for E. coll yeast 2 plasmid or yeast chromosomal DNA for yeast, and pRSVneo 
ATCC 371 98, pSV2dhtr ATCC 371 45, pdBPV-MMTneo ATCC 37224, pSV2neo ATCC 37149, and such for mammalian 
cells. 
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[0091] An enhancer sequence, polyadenylation site, and splicing j unction that are usually used in the art.-8uch as 
those derived from SV40 can also be used. 

[0092] A selectable marker usually employed can be used according to the usual method. Examples thereof are 
resistance genes for antibiotics, such as tetracycline, ampicillin, or kanamycin. 
5 [0093] Examples of genes for gene amplifrcation are dihydrofolate reductase (OHFR) gene, thymidine kinase gene, 
neomycin resistance gene, glutamate synthase gene, adenosine deaminase gene, ornithine decarboxylase gene, hy- 
gromydn-B-phophotransf erase gene, aspartate transcarbamylase gene, etc. 

[0094] The expression vector of the present invention can be prepared by continuously and circularly linking at least 
the above-mentioned promoter, initiation codon, DNA encoding the protein of the present invention, tennination codon, 
to and temiinator region, to an appropriate replicon. If desired, appropriate DNA fragments (for example, linkers, restriction 
sites, and so on), can be used by the usual method such as digestion with a restriction enzyme or ligation using T4 
DNA ligase. 

[0095] Transformants of the present invention can be prepared by Introducing the expression vector mentioned above 
into host cells. 

15 [0096] Host cells used in the present invention are not limited as long as they are compatible with an expression 
vector mentioned above and can be transf omied. Examples thereof are various cells such as wild-type cells or artificially 
established recombinant cells usually used in technical field of the present invention (for example, bacteria (Escherichia 
and Bacillus), yeast {Saccharomyces, Pichia, and such) , animal cells, or insect cells). 

[0097] E. coll or animal cells are preferably used. Specific examples are £ coli (DH5a, TBI , HB101 , and such), 
20 mouse-derived cells (COP, L, C127. Sp2/0, NS-1. NIH 3T3, and such), rat-derived cells (PC12. PC12h), hamster- 
derived cells (BHK, CHO, and such), monkey-derived cells (C0S1 , COS3, COS7, CV1 , Veto, and such), and human- 
derived cells (Hela, diploid fibroblast-derived cells, myeloma cells, and HepG2, and such). 
[0098] An expression vector can be introduced (transformed (transfected)) into host cells by known methods. 
[0099] Transformation can be performed, for example, according to the method of Cohen et al. (Proc. Natl. Acad. 
25 Sci. USA, Vol.69, p.2110 (1972)), protoplast method (Mol. Gen. Genet.. Vol.1 68, p.111 (1979)). or competent method 
(J. Mol. Biol., Vol.56, p.209 (1971)) when the hosts are bacteria (E. coft Badllus subWis, and such) , the method of 
Hinnen et al. (Proc. Natl. Acad. Sci. USA, Vol.75, p.1 927 (1 978)), or lithium method (J. BacterioL. Vol.1 53, p.1 63 (1 983)) 
when the host is Saccharomyces cerevisiae, the method of Graham (Virology, Vol.52, p.456 (1973)) when the hosts 
are animal cells, and the method of Summers et al. (Mol. Cell. BloL, Vol.3, pp.2156-2165 (1 983)) when the hosts are 
30 insect cells. 

. [01 00] The protein of the present invention can be produced by cultivating transformants (in the following, this tenm 
includes transfectants) comprising an expression vector prepared as mentioned above in nutrient media. 
[0101] The nutrient media preferably comprise carbon source, inorganic nitrogen source, or organic nitrogen source 
necessary for the growth of host ceils (transformants). Examples of the cartDon source are glucose, dextran, soluble 

35 starch, and sucrose, and examples of the Inorganb or organic nitrogen source are ammonium salts, nitrates, amino 
acids, com steep liquor, peptone, casein, meet extract, soy bean cake, and potato extract. If desired, they may comprise 
other nutrients (for example, an inorganic salt (for example, calcium chloride, sodium dihydrogenphosphate, and mag- 
nesium chloride), vitamins, antibiotics (for example, tetracycline, neomycin, ampicillin, kanamycin, and so on). 
[0102] Cultivation is perfonned by a method known in the art. Cultivation conditions such as temperature, pH of the 

40 media, and cultivation time are selected appropriately so that the protein of the present invention is produced in large 
quantities. 

[01 03] Specific media and cultivation conditions used depending on host cells are illustrated below, but are not limited 
thereto. 

[0104] When the hosts are bacteria, actinomycetes, yeasts, filamentous fungi, liquid media comprising the nutrient 
45 source mentioned above are appropriate. The media with plH 5 to B are preferably used. 

[0105] When the host is E. coli, examples of preferable media are LB media, M9 media (Miller et al. Exp. Mol. Genet. , 
Cold Spring Harisor Laboratory, p.431 (1 972)). and so on. Using these media, cultivation can be perfomned usually at 
1 4 to 43'*C for about 3 to 24 hours with aeration and stimng, if necessary. 

[0106] When the host is Bacillus, cultivation can be performed usually at 30 to 40''C for about 16 to 96 hours with 
50 aeration and stirring. If necessary. 

[0107] When the host is yeast, an example of media is Burkholder minimal media (Bostian, Proc. Natl. Acad. Sci. 
USA, Vol.77, p.4505 (1980)). The pH of the media is preferably 5 to 8. Cultivation can be pertonned usually at 20 to 
35°C for about 14 to 144 hours with aeration and stimng, if necessary. 

[0108] When the host is an animal cell, examples of media are MEM media containing about 5 to 20% fetal bovine 
55 serum (Science, Vol.122. p.501 (1 952)). DMEM media (Virology, Vol.8, p.396 (1 959)). RPMI1 640 media (J. Am. Med, 
Assoc., Vol.199, p.519 (1967)), 199 media (Proc. Soc. Exp. Biol. Med.. Vol.73, p.1 (1950)). and so on. The pH of the 
media is preferably about 6 to 8. Cultivation can be perfomied usually at about 30 to 40*C for about 15 to 72 hours 
with aeration and stimng, if necessary. 



15 



EP1 174509 A1 



[0109] When the host is an insect cell, an example of nnedia is Grace's media containing fetal bovine serum (Proc. 
Natl. Acad. Sci. USA. Vol.B2, p.B404 (1 985)). The pH thereof is preferably about 5 to 8. Cultivation can be performed 
usually at about 20 to 40*^0 for 15 to 100 hours with aeration and stim'ng, if necessary. 

[0110] The protein of the present invention can be produced by cultivating transformants, especially mammalian 
5 cells, as mentioned above and allowing them to secrete the protein Into the culture supernatant. 

[0111] A culture filtrate (supematant) is obtained by a method such as filtration or centrifugation of the obtained 
culture, and the protein of the present invention Is purified and isolated from the culture filtrate by methods commonly 
used in order to purify and isolate a natural or synthetic protein. 

[0112] Examples of the isolation and purification method are a method utilizing solubility, such as salting out and 
10 solvent precipitation method; a method utilizing the difference in molecular weight, such as dialysis, ultrafiltration, gel 
filtration, and sodium dodecyl sulfate-polyacrylamide gel electrophoresis; a method utilizing charges, such as ion ex- 
change chromatography and hydroxylapatite chromatography; a method utilizing specific affinity, such as affinity col- 
umn chromatography; a method utilizing the difference in hydrophobicity, such as reverse phase high perfomiance 
liquid chromatography; and a method utilizing the difference in isoelectric point, such as isoelectric focusing. 
IS [0113] When the protein of the present invention exists In the periplasm or cytoplasm of cultured transfomriants (for 
example, E. Colf^, first, the cells are harvested by a usual method such as filtration or centrifugation and suspended 
in appropriate buffer. After the cell wall and/or cell membrane of the cells and such are disrupted by the method such 
as fysis with sonication, lysozyme, and freeze-thawrng, the membrane fraction comprising the protein of the present 
invention is obtained by the method such as centrifugation or filtration. The membrane fraction is solubilized with a 
20 detergent such as Triton-XlOO to obtain the crude extract. Finally, the protein is isolated and purified from the crude 
extract by the usual method as illustrated above. 

[0114] By using a DNA (cDNA or genomic DNA) encoding a human-derived AID protein included in the protein of 
the present Invention, transgenic non-human mammals secreting the human AID protein in their body can be prepared. 
Namely, by integrating the human-derived DNA into an endogenous locus of non-human mammals (e.g. mouse), the 
25 human AID protein of the present Invention encoded by the DNA is expressed and secreted In their body. The transgenic 
non-human mammals are included in the present invention. 

[0115] The transgenic non-human mammals can be prepared according to the method usually used for producing 
a transgenic animal (for example, see "Newest Manual of Animal Cell Experiment", LIC press. Chapter 7, pp.361 -408, 
(1990)). 

30 [0116] Specifically, for example, a transgenic mouse can be produced as follows. Embryonic stem cells (ES cells) 
obtained from nomnal mouse blastocysts are transformed with an expression vector In which the gene encoding the 
human AID protein of the present invention and a marker gene (for example, neomycin resistance gene) have been 
inserted In an expressible manner ES cells in which the gene encoding the human AID protein of the present invention 
has been integrated into the endogenous gene are screened by a usual method based on expression of the marker 

35 gene. Then, the ES cells screened are microinjeded into a fertilized egg (blastocyst) obtained from another nonnal 
mouse (Proc. Natl. Acad. Sd. USA, V61.77, No.12, pp.73B0-7384 (1980); U.S. Pat. No. 4,873,191). 
[0117] The blastocyst is transplanted into the uterus of another normal mouse as the foster mother. Then, founder 
mice are bom from the foster mother. By mating the founder mice with nonnal mice, heterozygous transgenic mice are 
obtained. By mating the heterozygous transgenic mice with each other, homozygous transgenic mice are obtained 

40 according to Mendel's laws. 

[0118] Also, so-called "knockout mouse" can be generated based on the nucleotide sequence of DNA encoding 
mouse AID protein included in the present invention. The "knockout mouse" in the present invention means the mouse 
in which the endogenous gene encoding the mouse AID protein of the present invention is knocked-out (inactivated). 
For example, it can be generated by positive-negative selection method applying homologous recombination (U.S. 

45 Pat. No. 5.464,764; 5,487,992; 5,627,059; Proc. Natl. Acad. Sci. USA, V0I.B6, 8932-8935, 1989, Nature. Vol.342, 
435-438. 1989; etc.) , and such knockout mice are one embodiment of the present invention. 
[0119] The "antibody" in the present invention means a polyclonal antibody (antiserum) or a monoclonal antibody, 
and preferably a monoclonal antibody. 

[0120] Specifically, it includes an antibody reactive to the above-mentioned protein of the present invention and a 
50 fragment thereof. 

[0121] The "antibody" of the present invention also includes a natural antibody that can be prepared by immunizing 
mammals such as mice, rats , hamsters , guinea pigs , or rabbits with the protein of the present invention (including 
natural, recombinant, and chemically synthesized protein and cell), a fragment thereof, or a transformant highly ex- 
pressing the protein of Interest by recombinant technology mentioned above; a chimeric antibody and a humanized 
55 antibody (CDR-grafted antibody) that can be produced by recombinant technology; and a human monoclonal antibody 
that can be produced by using human antibody-producing transgenk: animals. 

[0122] The monoclonal antibody includes those having any one of the isotypes of IgG, IgM, IgA, IgD, or IgE. IgG or 
IgM is preferable. 
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[0123] The polyclonal antibody (antiserum) or monoclonal antibody of the present Invention can be produced by 
known methods. Namely, mammals, preferably, mice, rats, hamsters, guinea pigs, rabbits, cats, dogs, pigs, goats, 
horses, or bovine, or more preferably, mice, rats, hamsters, guinea pigs, or rabbits are immunized, for example, with 
an antigen mentioned above with Freund's adjuvant, if necessary. The polyclonal antibody can be obtained from the 
5 serum obtained from the animal so immunized. The monoclonal antibodies are produced as follows. Hybridomas are 
produced by fusing the antibody>producing cells obtained from the animal so immunized and myeloma cells incapable 
of producing autoantibodies. Then the hybridomas are cloned, and clones producing the monoclonal antibodies show- 
ing the specific affinity to the antigen used for immunizing the mammal are screened. 

[0124] Specifically, the monoclonal antibody can be produced as follows . Immunizations are done by Injecting or 
10 implariting once or several times the above-mentioned protein of the present invention, a fragment thereof, the cells 
that express the protein, and so on as an Immunogen, If necessary, with Freund's adjuvant, sut>cutaneousty, intramus- 
cularly, intravenously, through the footpad, or intraperitoneally into mice, rats, hamsters, guinea pigs, or rabbits, pref- 
erably mice, rats or hamsters (including transgenic animals generated so as to produce antibodies derived from another 
animal such as the transgenic mouse producing human antibody ) . Usually, immunizations are performed once to four 
15 times every one to fourteen days after ttie first Immunization. Antibody-producing cells are obtained from the manrvnal 
so immunized in about one to five days after the last immunization. 

[01 25] Hybridomas that secrete a monoclonal antibody can be prepared by the method of Kohler and Milstein (Nature, 
Vol.256, pp.495-497 (1975)) and by its modified method. Namely, hybridomas are prepared by fusing antibody-pro- 
ducing ceils contained in a spleen, lymph node, bone marrow, or tonsil obtained from the non-human mammal immu- 
20 nized as mentioned above, preferably a spleen, with myeloma cells without autoantibody-produdng ability, which are 
derived from, preferably, a mammal such as mice, rats , guinea pigs , hamsters, rabbits , or humans , or more preferably, 
mice, rats, or humans. 

[0126] For example, mouse-derived myeloma P3/X63-AGB.653 (653; ATCC No. CRL15B0), P3/NSI/1-Ag4-1 (NS- 
1) . P3/X63-Ag8.U1 (P3U1). SP2/0-Ag14 (Sp2/0, Sp2), PAI. FO, orBW5147; rat-derived myeloma 210RCY3-Ag.2.3.; 
^ or human-derived myeloma U-266AR1 . GM 1 500-6TG-A1 -2. UC729-6, CEM-AGR, D1 R1 1 . or GEM-T1 5 can be used 
as a myeloma used for the cell fusion. 

[0127] Hybridoma clones producing monoclonal antibodies can be screened by cultivating the hybridomas, for ex- 
ample, in microtiter plates and by measuring the reactivity of the culture supernatant in the well in which hybridoma 
growth is observed, to ttie immunogen used for the Immunization mentioned above, for example, by an enzyme im- 

30 munoassay such as Rl A and ELISA. 

[0128] The monoclonal antibodies can be produced from hybridomas by cultivating the hybridomas in vitro or in vivo 
such as in the ascites of mice, rats, guinea pigs, hamsters, or rabbits, preferably mice or rats, more preferably mice 
and Isolating the antibodies from the resulting the culture supernatant or ascites fluid of a mammal. 
[01 29] in vitro cultivation can be performed depending on the property of ceils to be cultured, on the object of a test 

35 study, and on various culture, by using known nutrient media or any nutrient media derived from known basal media 
for growing, maintaining, and storing the hybridomas to produce monoclonal antibodies in the culture supernatant. 
[01 30] Examples of basal media are low calcium concentration mediasuch as Ham' F12 medium, MCDB1 53 medium, 
or low calcium concentration MEM medium, and high calcium concentration media such as MCDB104 medium, MEM 
medium, D-MEM medium, RPMI164Q medium, ASF104 medium, or RD medium. The basal media can contain, for 

40 example, sera, homiones, cytokines, and/or various inorganic or organic substances depending on the objective. 
[01 31] Monoclonal antibodies can be isolated and purified from the culturo supernatant or ascites mentioned above 
by saturated ammonium sulfate precipitation, euglobulin precipitation mettiod, caproic acid method, caprylic acid meth- 
od, ion exchange chromatography (DEAE or DE52). affinity chromatography using anti-immunoglobulin column or 
protein A column. 

45 [01 32] Furthemiore, monoclonal antibodies can be obtained In a large quantity by cloning a gene encoding a mon- 
oclonal antibody from the hybridoma, generating transgenic bovines, goats, sheep, or pigs in which the gene encoding 
the anti'body Is integrated In its endogenous gene using transgenic animal generating technique, and recovering the 
monoclonal antibody derived from the antibody gene from milk of the transgenic animals (Nikkei Science, No.4. pp. 
78-B4 (1997)). 

50 [01 33] The "chimeric antibody" of the present invention means a monoclonal antibody prepared by genetic engineer- 
ing, and specifically, a chimeric monoclonal antibody, for example, mouse/human chimeric antibody, whose variable 
region is a mouse immunoglobulin-derived variable region and whose constant region is a human Immunoglobulin- 
derived constant region. 

[0134] The constant region derived from human immunoglobulin has the amino acid sequence inherent in each 
55 isotype such as IgG, IgM, IgA, IgD, IgE, etc. The constant region of the recombinant chimeric monoclonal antibody of 
the present invention can be that of human Immunoglobulin belonging to any isotype. Preferably, it is the constant 
region of human IgG. 

[01 35] The chimeric monoclonal antibody of the present invention can be produced, for example, as follows. Needless 
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to say, the production method is not limited thereto. 

[0136] For example, mouse/human chimeric monoclonal antibody can be prepared, by referring to Experimental 
Medicine: SUPPLEME^r^, Vol.1. 6, No.10 (1988); and Examined Published Japanese Patent Application (JP-B) No. 
IHei 3-73280. Namely, it can be prepared by ligating gene (C gene encoding the constant region of H chain) obtained 

5 from the DNA encoding human immunoglobulin to the downstream of active genes (rearranged VDJ gene encoding 
the variable region of H chain) obtained from the DNA encoding mouse monoclonal antibody isolated from the hybri- 
doma producing the mouse monoclonal antibody, and by ligating the gene (C gene encoding the constant region 
of L chain) obtained from the DNA encoding human immunoglobulin to the downstream of active genes (rearranged 
VJ gene encoding the variable region of L chain) obtained from the DNA encoding the mouse monoclonal antibody 

10 isolated from the hybridoma. and operabty Inserting those Into the same or different vectors In an expressible manner, 
followed by transfomnation of host cells with the expression vector, and cultivation of the transfomnants. 
[0137] Specifically, DNAs are first extracted from mouse monoclonal antibody-producing hybridoma by the usual 
method, digested with appropriate restriction enzymes (for example, EcoRt and Hindllt), electrophoresed (using, for 
example, 0.7% agarose gel), and analyzed by Southern blotting. After the electrophoresed gel is stained, for example, 

IS with ethidium bromide and photographed, the gel is gh^eri marker positions, washed twice with water, and soaked in 
0.25 M HCI for 15 minutes. Then, the gel is soaked in 0.4 N NaOH solution for 10 minutes with gentle stirring. The 
DNAs are transfen^ed to a filter for 4 hours following the usual method. The filter is recovered and washed twice with 
2 x SSC. After the filter is sufficiently dried, it is baked at 75"C for 3 hours, treated with 0.1 x SSC/0.1% SDS at 65*C 
for 30 minutes, and then soaked in 3 x SSC/0.1% SDS. The filter obtained is treated with prehybridization solution in 

20 a plastic bag at 65*C for 3 to 4 hours. 

[0138] Next, ^P-labeled probe DNA and hybridization solution are added to the bag and reacted at 65^C about 12 
hours. After hybridization, the filter is washed under an appropriate salt concentration, reaction temperature, and time 
(for example, 2 x SSC/0.1% SDS, room temperature, 10 minutes). The fitter is put into a plastic bag with a little 2 x 
SSC, and subjected to autoradiography after the bag is sealed. 

25 [01 39] Rearranged VDJ gene and VJ gene encoding H chain and L chain of mouse monoclonal antibody respectively 
are identified by Southern blotting mentioned above. The region comprising the identified DNA fragment is fractionated 
by sucrose density gradient oentrifugation and inserted into a phage vector (for example, Charon 4A, Charon 28, 
XEMBL3, XEMBL4, etc.). E. co// (for example, LE392, NM539, etc.) are transfomned with the phage vector to generate 
a genomic library. The genomic library is screened by plaque hybridization such as the Benton-Davis method (Science, 

30 Vol.196, pp.1 80-1 82 (1977)) using appropriate probes (H chain J gene, L chain (k) J gene, etc.) to obtain positive 
clones comprising rean^nged VDJ gene or VJ gene respectively. By making the restriction map and determining the 
nucleotide sequence of the clones obtained, it is confimied that genes comprising the desired, rearranged V^ (VDJ) 
gene or V|^ (VJ) gene have been obtained. 

[0140] Separately, human C^ gene and human C^^ gene used for chimerization are isolated. For example, when a 
35 chimeric antibody with human lgG1 is produced, Cy^ gene Is isolated as a C^ gene, and Ok gene Is also isolated as 
a Ci^ gene, are isolated. These genes can be isolated from human genomic library with mouse Cy^ gene and mouse 
Cic gene, con^esponding to human Cy^ gene and human Cic gene, respectively, as probes, taking advantage of the 
high homology between the nucleotide sequences of mouse immunoglobulin gene and that of human immunoglobulin 
gene. 

40 [0141] Specifically, DNA fragments comprising human Ck gene and an enhancer region are isolated from human X 
Charon 4A Haelll<Alul genomic library (Cell, Vol.15, pp.1157-1174 (1978)), for example, using a 3 kb Hindlll-BamHi 
fragment from clone ig146 (Proc. Natl. Acad. Sci. USA. Vol.75, pp.4709-4713 (1978)) and a 6.8 kb EcoRI fragment 
from clone MEP10 (Proc. Natl. Acad. Sci. USA, Vol.78, pp.474-478 (1981)) as probes. In addition, for example, after 
human fetal hepatocyte DNA is digested with Hindill and fractioned by agarose gel electrophoresis, a 5.9 kb fragment 

45 is inserted into X78B and then human Cyi gene is isolated with the probes mentioned above. 

[01 42] Using mouse V^ gene, mouse V^ gene, human C^ gene, and human C|^ gene so obtained, and taking promoter 
region and enhancer region into considemtlon, human C^ gene is inserted downstream of mouse V^ gene and human 
Cl gene is inserted downstream of mouse Vl gene in an expression vector such as pSV2gpt or pSV2neo with appro- 
priate restriction enzymes and DNA ligase following the usual method. In this case, chimeric genes of mouse V^ gene/ 

50 human C^ gene and mouse V|_ gene/human C^ gene can be respectively inserted into a same or different expression 
vector 

[0143] Chimeric gene-inserted expression vector(s) thus prepared are introduced into myeloma cells (e.g., P3X63 
AgB 653 cells or SP21 0 cells) that do not produce antibodies by the protoplast fusion method, DEAE-dextran method, 
calcium phosphate method, or electroporation method. The transformants are screened by cultivating in a medium 
55 containing a drug corresponding to the drug resistance gene inserted into the expression vector and, then, cells pro- 
ducing desired chimeric monoclonal antibodies are obtained. 

[01 44] Desired chimeric monoclonal antibodies are obtained from the culture supematant of antibody-producing cells 
thus screened. 
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[01 45] ♦ The "humanized antibody (CDR-grafted antibody)" of the present invention is a monoclonal antibody prepared 
by genetic engineering and specifically means a humanized monoclonal antibody wherein a portion or the whole of 
the complementarity detemiining regions of the hyper-variable region are derived from those of the hyper-variable 
region from mouse monoclonal antibody, the framework regions of the variable region are derived from those of the 
5 variable region from human immunoglobulin, and the constant region is derived from that from human-immunoglobulln. 
[0146] The complementarity detemiining regions of the hyper-variable region exists in the hyper-variable region in 
the variable region of an antibody and means three regions which directly binds, in a complementary manner, to an 
antigen (complementarity-detemiining residues , CDR1, CDR2, and CDR3). The framework regions of the variable 
region mean four comparatively conserved regions intervening upstream, downstream or between the three comple- 
te mentarity-detemiining regions (framework region, FR1 , FR2, FR3, and FR4). 

[0147] In other words, a humanized monoclonal antibody means that in which the whole region except a portion or 
the whole region of the complementarity determining regions of the hyper-variable region of a mouse monoclonal 
antibody has been replaced witii their con'esponding regions derived from human immunoglobulin. 
[0148] The constant region derived from human immunoglobulin has the amino acid sequence inherent in each 
15 isotype such as IgG, IgM, IgA. IgD, and IgE. The constant region of a humanized monoclonal antibody in the present 
invention can be that from human immunoglobulin belonging to any isolype. Preferably, it is the constant region of 
human IgG. The framewori< regions of the constant region derived from human immunoglobulin are not particularly 
limited. 

[01 49] The humanized monoclonal antibody of the present invention can be produced, for example, as follows. Need- 

20 less to say, the production method is not limited thereto. 

[01 50] For example, a recombinant humanized monoclonal antibody derived from mouse monoclonal antibody can 
be prepared by genetic engineering, referring to Published Japanese Translations of PCT Intemational Publication No. 
Hei 4-506458 and Unexamined Published Japanese Patent Application (JP-A) No. Sho 62-296890. Namely, at least 
one mouse H chain CDR gene and at least one mouse L chain CDR gene corresponding to the mouse H chain CDR 

25 gene are isolated from hybridomas producing mouse monoclonal antibody, and human H chain gene encoding the 
whole region except human H chain CDR corresponding to mouse H chain CDR mentioned above and human L chain 
gene encoding the whole region except human L chain CDR con^esponding to mouse L chain CDR mentioned above 
are isolated from human immunoglobulin genes. 

[0151] The mouse H chain CDR gene(s) and the human H chain gene(s) so isolated are inserted, in an expressible 
30 manner, into an appropriate vector so that they can be expressed. Similarly, the mouse L chain CDR gene(s) and the 
human L chain gene (s) are inserted, in an expressible manner, into another appropriate vector so that they can be 
expressed. Altematively, the mouse H chain CDR gene(s)/human H chain gene(s) and mouse L chain CDR gene(s)/ 
human L chain gene(s) can be inserted, in an expressible manner, into the same expression vector so that they can 
be expressed. Host cells are transformed with the expression vector thus prepared to obtain transformants producing 
35 humanized monoclonal antibody. By cultivating the transformants, desired humanized monoclonal antibody is obtained 
from culture supernatant 

[01 52] The "human antibody" used In the present Invention is immunoglobulin in which the entire regions comprising 
the variable and constant region of H chain, and the variable and constant region of L chain constituting immunoglobulin 
are derived from the genes encoding human immunoglobulin. 
40 [0153] The human antibody can be produced in the same way as the production method of polyclonal or monoclonal 
antibodies mentioned above by Immunizing, with an antigen, a transgenic animal which for example, at least human 
immunoglobulin gene(s) have been integrated into the locus of a non-human mammal such as a mouse by the usual 
method. 

[0154] For example, a transgenic mouse producing human antibodies is prepared by the methods described in al- 
45 ready published literatures (Nature Genetics, VoL7, pp.13-21 (1994); Nature Genetics, Vol.15, pp.146-156 (1997); 
JP-WA Hei 4-504365; W094/255B5; Nikkei Science. No.6, pp.40-50 (1995); W094/255B5; Nature, Vol.36B, pp. 
856-859 (1 994) ; JP-WA No. Hei 6-500233). 

[01 55] The "portion of an antibody" used in the present invention means a partial region of the antibody, and preferably 
the monoclonal antibody of the present invention as mentioned above, and specifically, means F(ab')2, Fab', Fab, Fv 
50 (variable fragment of antibody), sFv, dsFv (disulfide stabilized Fv), or dAb (single domain antibody) (Exp. Opin. Then 
Patents, Vol.6, No.5, pp.441-456 (1996)). 

[01 56] "F(ab')2" and "Fab*" can be produced by treating immunoglobulin (monoclonal antibody) with a protease such 
as pepsin and papain, and means an antibody fragment generated by digesting immunoglobulin near the disulfide 
bonds existing between the hinge regions In each of the two H chains. For example, papain cleaves IgG upstream of 
55 the disulfide bonds existing between the hinge regions in each of the two H chains to generate two homologous antibody 
fragnnents in which an L chain composed of Vl (L chain variable region) and Gl (L chain constant region) , and an H 
chain fragment composed of Vh (H chain variable region) and ChYI (yl region in the constant region of H chain) are 
connected at their C terminal regions through a disulfide bond. Each of these two homologous antibody fragments is 
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called Fab*. Pepsin also cleaves IgG downstream of the disulfide bonds existing between the hinge regions in each of 
the two H chains to generate an antibody fragment slightly larger than the fragment in which the two above-mentioned 
Fab' are connected at the hinge region. This antibody fragment is called f(ah%. 

[01 57] The "celt producing a monoclonal antibody reactive to a protein or a fragment thereof of the present invention 
s means any cell producing the above-described monoclonal antibody of the present invention. 
[01 SB] More specifically, the following is included: 

(1) B cells that are obtained by immunizing the non-human mammals with the above-mentioned protein of the 
present Invention, a fragment thereof, or the cells producing the protein and that produce a monoclonal antibody 

10 reactive to the protein of the present invention or a fragment thereof. 

(2) The abovennentioned hybridomas (fused cell) prepared by fusing the thus-obtained B cells producing the 
antibody with myeloma cells derived from mammals. 

(3) Monoclonal antibody-producing transfomiants obtained by transfomiing other cells than the monoclonal anti- 
body-producing Bcells and hybridomas with genes encoding the monoclonal antibody Isolated from the monoclonal 

IS antibody-producing B cells or hybridomas (either the heavy chain-encoding gene or the light chain-encoding gene, 
or both). 

[0159] The monoclonal antibody-producing transfomnants of (3) mean recombinant cells producinga recombinant 
monoclonal antibody produced by B cells of (1) or hybridomas of (2). These antibody producing-transformants can be 
20 produced by the method as used for producing the above-described chimeric monoclonal antibody and humanized 
monoclonal antibody. 

[0160] The "phamiaceutical composition" used herein means a phannaceutlcal composition comprising of any of 
the protein, fragment thereof, antibody, or portion thereof defined hereinabove, and a pharmaceutically acceptable 
carrier. 

25 [0161] The "pharmaceutically acceptable carrier" includes an exdpient, a diluent, an expander, a disintegrating agent, 
a stabilizer, a preservative, a buffer, an emulsifler, an aromatic, a colorant, a sweetener, a viscosity-increasing agent, 
a flavor, a dissolving agent, or other additives. Using one or more of such carriers, a phamfiaceutical composition can 
be fomriulated into tablets, pills, powders, granules, injections, solutions, capsules, troches, elixirs, suspensions, emul- 
sions, or syrups. The phamriaceutical composition can be administered orally or parenteralty. Other fonns for parenteral 

30 administration include a solution for external application, suppository for rectal administration , and pessary, prescribed 
by the usual method, which comprises one or more active ingredient. 

[0162] The dosage can vary depending on the age, sex, weight, and symptoms of a patient, effect of treatment, 
administration route, period of treatment, or the kind of active ingredient (protein or antibody mentioned above) con- 
tained in the pharmaceutical composition. Usually, the pharmaceutical composition can be administered to an adult in 

35 a dose of 10 (ig to 1000 mg (or 10 ^g to 500 mg) per one administration. Depending on various conditions, the lower 
dosage may be sufficient in some cases, and a higher dosage may be necessary in other cases. 
[0163] In particular, the injection can be produced by dissolving or suspending the antibody in a non-toxic, phamia- 
ceutically acceptable canrler such as physiological saline or commercially available distilled water for injections by 
adjusting the concentration to 0.1 jig antibody/ml carrier to 1 0 mg antlbody/ml carrier. The Injection thus produced can 

40 be administered to a human patient in need of treatment in a dose of 1 ^.g to 1 00 mg/icg body weight, preferably 50 ^g 
to 50 mg/kg body weight, once or more times a day. Examples of administratbn routes are medicaliy appropriate 
administration routes such as intravenous injection, subcutaneous Injection, Intradermal injection, intramuscular injec- 
. tion, or intraperitoneal injection, preferably intravenous injection. 
[0164] The injection can also be prepared into a non-aqueous diluent (for example, propylene glycol, polyethylene 

45 glycol, vegetable oil such as olive oil, and alcohols such as ethanol), suspension, or emulsion. 

[0165] The Injection can be sterilized by filtration with a bacteria-non-penetrable filter, by mixing bacteriocide, or by 
Irradiation. The injection can be prepared at the time of use. Namely, it is f reeze-dried to make a sterile solid composition, 
and can be dissolved in sterile distilled water for injection or another solvent before use. 

[0166] The phannaceutlcal composition of the present invention is useful as a drug for preventing and treating, for 
50 example, primary immunodeficiency syndrome with congenital disorder of immune system, mainly immunodeficiency 
considered to develop by B lymphocyte deficiency, decrease, or dysfunction (e.g., sex-linked agammaglobulinemia, 
sex-linked agammaglobulinemia with growth hormone deficiency, Immunoglobulin deficiency with high IgM level, se- 
lective IgM deficiency, selective IgE deficiency, immunoglobulin heavy chain gene deletion, k chain deficiency, IgA 
deficiency. IgG subclass selective deficiency, CVID (common variable Immunodeficiency), Infantile transient dysgam- 
55 maglobullnemia, Rosen syndrome, severe combined Immunodeficiency (sex-linked, autosomal recessive), ADA (ad- 
enosine deaminase) deficiency, PNP (purine nucleoside phosphorylase) deficiency. MHC class II deficiency, reticular 
dysplasia. Wiskott-Aldrich syndrome, ataxia telangiectasia, DiGeorge syndrome, chromosomal aberration, familial Ig 
hypennetabolism, hyper IgE syndrome, GItiin syndrome, Nezelof syndrome. Good syndrome, osteodystrophy, transoo- 
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balamin syndrome, secretory bead syndrome, etc.), various diseases with antibody production deficiency that are 
secondary immunodeficiency syndrome with disorder of immune system caused by an acquired etiology (for example, 
AIDS, etc.), and/or various allergic diseases (e.g., bronchial asthma, atopic demnatitis, conjunctivitis, allergic rhinitis, 
allergic enteritis, drug-induced allergy, food allergy, allergic urticaria, glomerulonephritis, etc.) , and for reliving condition 
5 due to various Immunodeficiency with the diseases. 

[01 67] The DN A of the present invention described above, namely, "DNA comprising any partial nucleotide sequence 
of SEQ ID NO: 7. from SEQ ID NO: 9 to SEC ID NO: 15, SEQ ID NO: 35, those with partial chernical modification, 
DNA comprising complementary nucleotide sequences to the partial sequence, or those with partial chemical modifi- 
cation" are Included. 

10 [0168] Here, the "partial nucleotide sequence" means the partial nucleotide sequence comprising any number of 
bases at any region included In any nucleotide sequence listed in SEQ ID NO: 7, from SEQ ID NO: 9 to SEQ ID NO: 
15, or SEQ ID NO: 35. 

[0169] The DNA is useful as probes in DNA hybridization or RNA hybridization procedures. In the purpose of using 
the DNA as a probe, continuous nucleotide sequence of over 20 bases, preferably continuous nucleotide sequence 
15 of over 50 bases, more preferably over 1 00 bases, much more preferably over 200 bases, especially preferably over 
300 bases, can be listed as the partial nucleotide sequences. 

[0170] Also, the DNA described above as mentioned before, are useful as primers for PGR. In the purpose of using 
the DNA as PGR primers, continuous partial nucleotide sequence of from 5 to 100 bases, preferably from 5 to 70 
bases, more preferably from 5 to 50 bases, much more preferably from 5 to 30 bases, can be listed as the partial 
^ nucleotide sequences. 

[0171] Moreover, the DNA described above are useful as antisense drug. The DNA, hybridizing to a DNA or an RNA 
encoding the AID protein of the present invention, can inhibit transcription of the DNA to mRNA or translation of the 
mRNA into the protein. 

[0172] In purpose of using above-mentioned DNA to antisense drug, the partial nucleotide sequence consists of 5 
25 to 1 00 consecutive nucleotides, preferably 5 to 70 consecutive nucleotides, more preferably 5 to 50 consecutive nu- 
cleotides, and still more preferably 5 to 30 consecutive nucleotides. 

[0173] When the DNA is used as an antisense DNA pharmaceutical, the DNA sequence can be modified chemically 
in part for extending the half-life (stability) of the blood concentration of the DNA administered to patients, for increasing 
the intracellular-membrane permeability of the DNA, or for increasing the degradation resistance or the absorption of 
30 the orally administered DNA in the digestive organs. The chemical modification includes, for example, the modification 
of the phosphate bonds , the n'boses, the nucleotide bases, the sugar moiety, the 3' end and/or tiie 5' end In tiie structure 
of the oligonucleotide DNA. 

[0174] The modification of phosphate bond includes, for example, the conversion of one or more of the bonds to 
phosphodiester bonds (D-oligo), phosphorothioate bonds, phosphorodithioate bonds (S-oligo), methyl phosphonate 

35 (MP-ollgo), phosphoroamidate bonds, non-phosphate bonds or methyl phosphonothloate bonds, or combinations 
thereof. The modification of the ribose includes, for example, the conversion to 2'-fluororibose or 2'-0-methylribose. 
The modification of the nucleotide base Includes, for example, the conversion to 5-propynyluracil or 2-amlnoadenlne. 
[0175] Also, another one of the present invention relates to "methods of identifying substances regulating the pro- 
duction of the AID protein of tiie present invention or the transcription of the gene encoding AID protein to mRNA"'. 

40 The method of the present invention is namely "the method of screening of drugs capable of regulating functions of 
AID protein or AID gene". 

[01 76] As the cells used In the method of the present invention , any cells, as long as capable of producing AID protein 
of the present invention, can be used. For instance, native cells (preferably of mouse or human), transgenic cells 
transformed with a gene encoding AID protein of the present invention, cells introduced with RNA encoding AID protein 
45 of the present invention, etc., can be listed. 

[0177] As the host cells used for preparing the transgenic cells, various cells, mentioned in the part explaining in 
detail on the method of expressing the protein of the present invention using the DNA of the protein described above, 
can be used. 

[0178] For instance, various cells such as naturally established cells or artificially established transgenic cells (e.g. 

50 bacteria (Escherichia, Badttus), yeast {Saccharorryces, Pichia), animal cells and insect cells) can be exemplified. 
[0179] Preferably, animal cells, namely, cells derived from mouse (COP, L, G127, Sp2/0, NS-1, or NIH3T3, etc.), 
cells derivedfrom rat (PG12, PC1 2h, etc.), cells derived from hamster (BHK, and GHO, etc.) , cells derived from monkey 
(G0S1 , COS3, G0S7, GV1 , and Velo, etc.) , and cells derived from human (Hela, cells derived from diploid fibroblast, 
HEK293 cells, myeloma cells, and Namalwa, etc.) can be exemplified. 

55 [0180] The "substance" in the present invention means natural substance existing in the nature and any substance 
prepared artificially. The substances can be grouped into "peptidic suk)stance" and "non-peptidic substance". 
[0181] As tiie "non-peptidic substance", "DNA comprising partial nucleotide sequence, or chemically modified DNA 
derived from If that are useful as antisense drug as described above, "antisense RNA" with similar structural and 
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pharmacological property to the antisense DNA. or any chemically synthesized "compounds" can be exemplified. The 
"compounds'* herein means compounds excepting DNA, RNA, and peptidic substances. Namely, compounds with 
molecular weight of smaller than from 1 00 to approximately 1 000, preferably compounds with molecular weight of from 
1O0 to 800, more preferably molecular weight of from 100 to 600, can be exemplified. 

5 [01 82] As the "peptidic substance", antibodies already described above In detail (preferably monoclonal antibodies, 
more preferably recombinant antibodies or human monoclonal antibodies), oligopeptides, or chemically modified sub- 
stance derived from them can be exemplified. Examples of an oligopeptide are a peptide comprising 5 to 30 amino 
acids, preferably 5 to 20 amino acids. The chemical modification can be designed depending on various purposes, for 
example, the increased half-life in blood in the case of administering in vivo, or the increased tolerance against the 

10 degradation or Increased absorption in digestive tract at the oral administration. 

[01B3] Methods described in from (24) to (28) above includes so^lled reporter gene assay, as one of the method 
of the present invention. 

[01 84] As the "reporter protein", luciferase derived from firefly or sea pansy, or GFP derived from jenyf Ish, are pre- 
ferred. 

IS [0185] As the "reporter gene assay", methods described below are representative. 

[0188] Transgenic cells are generated by transforming cells commonly used in the production of recombinant proteins 
with expression vector. In which a gene encoding the target protein and a gene encoding the reporter protein are 
inserted to the vector so that the transcription of the gene encoding the reporter protein to mRNA occurs dependently 
on the signal of the transcription of the gene of target protein to mRNA. The test substances (described above) are 

20 applied to the obtained transformant cells. Analysis that whether the compound affects the expression of transporter 
molecule can be accomplished by measuring the level of the target protein by indirect measuring of the amount of 
fluorescence emitted by the reporter protein expressed In parallel with the target protein (for reference, see U.S. Pat. 
No. 5,436,128 and U.S. Pat. No. 5,401 ,629). 

[0187] Also, the identification of the compounds using the present assay can be accomplished by manual operation, 
25 but it can also be readily and simply done automatically by using so-called High-Throughput Screening using robots 
(SOSHIKI BAIYO KOUGAKU, Vol.23, No.13, p.521-524; U.S. Pat. No. 5.670,113). 

[0188] The "cells" and "substances" used in the methods described above contain the same meaning as defined 
above. 

[0189] The substances identified by the methods of the present Invention are very useful as drug for therapy of 
30 various diseases considered to be caused by the hyperfunction or deficiency of the AID protein of the present Invention 
or by the deficiency or mutation of the AID gene, or for remission of various symptoms supervene with the diseases. 

Brief Description of the Drawings 

35 [0190] Figurel is the photograph which shows the production state DNA including an Sa sequence looped out with 
the class switch recombination in mouse B cell clone CH12F3-2 cultured under the various conditions. 
[0191] Figure 1 (a) shows the electrophoretic state of DNA including an Sa sequence looped out with class switch 
recombination, amplified by PCR using DNA derived from mouse B cell clone CH12F3-2 cultured under the various 
conditions. 

^ [01 92] Lanes 1 and 6 show the electrophoretic state of marker DNAs. Lane 2 shows the electrophoretic state of PCR 
product using DNA from cells cultured in the condition excluding IL-4, CD40L, TGF p or cycloheximide, as a template. 
Lane 3 shows the electrophoretic state of DNA product using DNA from cells cultured In the presence of cycloheximide 
only, as a template. Lane 4 shows the electrophoretic state of PCR product using DNA from cells cultured in the 
presence of IL-4, CD40L and TGF p, as a template. Lane 5 shows the electrophoretic state of PCR product cultured 

45 in the presence of 11-4, CD40L, TGF p, and cycloheximide, as a template. 

[0193] Figure 1 (b) shows the result of Southem hybridization for DNA including an Sa sequence looped out with 
class switch recombination, amplified by PCR using DNA derived from mouse B cell clone CH12F3-2 cultured under 
the various conditions. 

[0194] Lane 1 shows the result of hybridization against PCR product using DNA from cells cultured in the condition 
50 excluding any of IL-4, CD40L, TGFp or cycloheximide, as a template. Lane 2 shows the result of Southem hybridization 

against PCR product using DNA from cells cultured in the presence of cycloheximide only, as a template. Lane 3 shows 

the result of hybridization against PCR product using DNA from cells cultured In the presence of IL^, CD40L, and TGF 

P, as a template. Lane 4 shows the result of hybridization against PCR product using DNA from celts cultured In the 

presence of iL-4. CD40L, TGF p and cycloheximide, as a template. 
S5 [0195] Figure 2 is a photograph showing the production state of DNA including an Sa sequence looped put with 

class switch recombination, amplified by PCR using DNA derived from mouse B celt clone CH 1 2F3-2 cultured in various 

conditions. 

[0196] Figure 2 (a) shows the electrophoretic state of DNA from DNA including an Sa sequence looped out with 
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class switch recombination In mouse B cell clone CH12F3'2 cultured In the various conditions, stained with ethidium 
bromide. 

[0197] Lanes 1 and 6 show the electrophoretic state of marker DNAs. Ljine 2 shows eiecirophoretic state of PGR 
product using DNA from celts cultured in the condition excluding IL-4, CEMOL, TGF p or cydoheximide, as a template. 
5 Lane 3 shows the electrophoretic state of DNA product using DNA from ceils cultured in the presence of cydoheximide 
only, as a template. Lane 4 shows the electrophoretic state for PGR product using DNA from cells cultured in the 
presence of IL-4, GD40L and TGF p, as a template. Lane 5 shows the electrophoretic state for PGR product cultured 
in the presence of 11-4, CD40L, TGF p. and cydoheximide, as a template. 

[0198] Figure 1 (b) shows the result of Southern hybridization for DNA including an Sa sequence looped out with 
to class switch recombination, amplified by PGR using DNA derived from mouse B cell clone CiH12F3-2 cultured under 
the various conditions. 

[0199] Lane 1 shows the result of hybridization against a PGR product using DNA from cells cultured under the 
condition excluding any one of IL-4, GD40L, TGF p or cydoheximide, as a template. Lane 2 shows the result of hy- 
bridization against a PGR product using DNA from celts cultured in the presence of cydoheximide only, as a template. 

IS i_ane 3 shows the result of hybridization against a PGR product using the DNA from celts cultured in the presence of 
IL-4, GD40L, and TGF p, as a template. Lane 4 shows the result of hybridization against a PGR product using DNA 
from cells cultured in the presence of IL-4, GD40L, TGF p and cydoheximide. as a template. 
[0200] Figure 3 shows the result of Northern blotting using a cDNA fragment coding a radiolabeled 23C9 (AID) 
protein, against mRNA derived from mouse B cell clone GH1 2F3-2 cultured under the various conditions. 

20 [0201] Lane 1 shows the result of blotting against mRNA from cells cultured in the condition exduding any one of 
tL-4, CD40L, or TGF p or cydoheximide. Lane 2 shows the result of blotting against mRNA from cells cultured in the 
presence of cydoheximide, only. Lane 3 shows the result of blotting against mRNA from cells cultured in the presence 
of IL-4, GD40L and TGF p. Lane 4 shows the result of blotting against mRNA cultured in the presence of IL-4, GD40L, 
TGF p, and cydoheximide. 

25 [0202] Figure 4 shows the result of Northern blotting using radio-labeled cDNA fragment coding 23G9 (AID) protein 
as a probe against mRNA derived from mouse B cell clone CH12F3-2 cultured in the various conditions. 
[0203] Lane 1 shows the result of blotting against mRNA from cells cultured in the condition exduding IL-4, CD40L, 
TGF p or cydoheximide. l-ane 2 shows the result of blotting against mRNA from cells cultured in the presence of 
cydoheximide, only. Lane 3 shows the result of the blotting against mRNA form cells cultured in the presence of IL-4, 

30 GD40L, and TGF p. Lane 4 shows the result of the blotting against mRNA form cells cultured in the presence of IL-4, 
GD40L, TGF p and cydoheximide. 

[0204] Figure 5 shows the homology between an amino add sequence of mouse AID protein and that of mouse 
APOBEG-1 

[0205] An amino add in a closed box shows an identical amino acid. A region in an open box indicates a cytidine 
35 deaminase motif. An amino acid with an asterisk {*) or an arrow indicates an amino acid conserved among APOBEG- 
1 proteins derived from rat, mouse, rabbit, and human. 

[0206] Figure 6 shows a phylogenic tree of various enzymes belonging to a cytosine nucleoside / nucleotide deam- 
inase family, prepared based on cytidine deaminase motif. 

[0207] Figure 7 shows a photograph indicating the etedrophoretk: state for AID-GST fusion protein in the molecular 
40 weight analysis by the gel electrophoresis and sih^er staining method. 

[0208] Lane 1 shows the electrophoretk; state for a mari<er molecule. Lane 2 shows the electrophoretic state for 
various proteins included in extracts from wild type Escherichia colt DH 5a. Lane 3 shows the electrophoretic state for 
purified AID-GST fusion protein. 

[0209] Figure 8 shows the electrophoretic state for AID-GST fusion protein by Westem blotting using anti-AID protein 
45 peptide antibody. 

[0210] Lane 1 shows the eledrophoretic state for various proteins included in the extract from wild type El co// DH5a. 
[0211] Lane 2 shows the electrophoretic state for purified AID-GST protein. 

[0212] Figure 9 shows a cytidine deaminase adivity depending on the concentrations of AID proteins. 
[0213] Figure 1 0 shows the inhibitory effect of tetrahydrouridine which is an inhibitor specific to cytidine deaminase 
so on a cytidine deaminase activity in AID protein. 

[0214] Figure 11 shows the inhibitory effect of each of 1 ,10-o-phenanthrolime which is a zinc-chelating agent, and 
l./'O-phenanthroline which is an inactivated isomer thereof on a cytidine deaminase activity in AID protein. 
[021 5] Figure 1 2 is a photograph Indicating expression state for mRNA of AID in various tissues in mouse, analyzed 
by Northern blotting method. 

55 [0216] Figure 13 is a photograph indicating the expression state for mRNA of AID in various lymphatic tissues in 
mouse, analyzed by RT-PGR method. 

[0217] Figure 14 is the photograph showing expression state for mRNA of AID as time goes, in activated mouse B 
cell done GH1 2F3-2, analyzed by Northern blotting method. 
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. [0218] Figure 1 5 a photograph showing expression state for mRNA of AID In nnouse Bcell done CH12F3-2 stimulated 
with cytokine in various combinations, analyzed by Northern blotting. 

[0219] Figure 1 6 shows a photograph indicating expression statef or mRNA of AID in mouse spleen Bcelts, stimulated 
with stimulants in various combinations, analyzed by Northern hybridization method. 
5 [0220] Figure 17 is a photograph indicating expression state for mRNA of AID in splenocytes derived from mice 
immunized with sheep red blood cells, analyzed by Northern blotting analysis. 

[0221 ] Figure 1 8 shows expression state for mRNA of AID in splenocytes derived from mice Immunized with sheep 
red blood cells, analyzed by RT-PCR. 

[0222] Figure 1 9 Is a photograph indicating localization of expression for AID mRNA in splenocytes derived from a 
10 nomial mouse or a mouse immunized by sheep red blood cells, specifically, analyzed by in situ hybridization. 

[0223] Figure 1 9 (A) and (D) indicate the result in the hybridization using a sense AID probe. Figure 19 (B) and (E) 
show localization for AID mRNA expression in hybridization using an antisense^AID probe. Figure 19 (C) and (F) show 
localization of germinal center in staining test by FITG-labeled PNA. Figures 19 (A) , (B) . and (C) indicate the result 
in the test using spleen tissues derived from normal mouse (before the immunization of sheep red blood cells). Figure 
15 19 (D), (E), and (F) show the results of the examination using spleen tissue slices prepared 5 days after immunizing 
a mouse with sheep red blood cells. 

[0224] Figure 20 is a photograph showing the localization of expression for AID mRNA in spleen tissue and payer's 
patch tissue, respectively, derived from a nomnal mouse or from a mouse immunized with sheep red blood cells, re* 
spectively, analyzed by in situ hybridization. 

20 [0225] Figure 20 (A), (D), and (G) show the results in the hybridization using a sense AID probe. Rgure 20 (B). (E), 
and (H) show the localization of the expression for AID mRNA in hybridization using an antisense AID probe. Figure 
20 (C), (F), and (I) show the localization of germinating center In the staining test by FITC-labeled PNA. Figure 20 (A), 
(B),and (C) show the result of the examination using spleen tissues derived from a normal mouse (before immunization 
by sheep red blood cells). Figure 20 (D), (E). and (F) show the results of the examination using spleen tissue slices 

25 prepared 5 days after immunization of a mouse with sheep red blood cells. Figure 20 (G), (H), and (I) show the results 
of test using payer's patch prepared 5 days after the immunization of a mouse with sheep red blood cells. 
[0226] Figure 21 schematically shows relative locations of partial nucleotide sequences of human genomic DNA 
coding human AID protein, which was amplified by PGR using various pairs of primers. 

[0227] Figure 22 schematically shows a degree of homology between an amino acid sequence of mouse AID protein 
30 and that of human AID protein. The parts with a closed box are cytidine and deoxycytldylate deaminase zino-binding 
region which is an AID protein active region, 

[0228] Figure 23 schematically shows the structure of human genomic DNA including a gene coding human AID 
protein. One to five shows exon 1 , exon 2, exon 3, exon 4, and exon 5, respectively. 

[0229] Figure 24 is.^ photograph indicating the expression state for human AID mRNA in various types of human 
35 tissues, analyzed by RTT-PCR. 

[0230] Figure 25 is a photograph indicating a location (localization) of human AID gene on human chromosome, 

analyzed by Fluorescence in situ hybridization (FISH) method. 

[0231] Two points at the tips of arrows show 12p13 where human AID gene exists. 

40 Best Mode for Canying out the Invention 

[0232] The invention is illustrated in details by the following Examples, but not restricted to embodiments described 
In the Examples. 

45 Example 1 : Culture of for mouse B cell clone CH12F3-2 and confirmation of properties 

[0233] Mouse B cell clone CH12F3-2 occunring class switch recombination (CSR) from IgM to IgA, several hours 
after the stimulation by IL-4, TGF-p, and CD40L, previously isolated by the present inventors, was cultured in the same 
manner as in the previous report (Immunity, Vol. 9, p. 1-10,1 998; Cun^. Biol., Vol. 8, No. 4 p 227-230, 1 998; Int. Immunol, 
50 Vol. B. No. 2. p. 1 93-201 , 1 996). 

[0234] When the CH12F3-2 is stimulated by IL-4, TGF-p, and CD40L, a circular DNA including an S region (switch 
region) looped out by class switch recombination was detected several hours after the stimulation. 
[0235] The following manipulation was conducted according to the previous report (Cun*. Biol., Vol. 8, No. 4, p. 
227-230, 1998). 

55 [0236] The B cell CH12F3-2 stimulated by IL-4, TGF-p, and CD40L, and that which was not stimulated were cultured 
for 6 hours In the presence or absence of cycloheximide (200 ng/ml) which is a protein synthesis inhibitor, respectively. 
Genomic DNA was extracted from each cell, and PGR was conducted with the DNA as a template by following the 
standard method to amplify circular DNA including an sequence and an Sa sequence. PGR was conducted by 
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using a pair of primers, aFI and p.R3 and the other PGR was conducted by using a pair of primers, aFI and p.R3. 
[0237] As a control, genomic DNA coding gtyceraldehyde-S-phosphate dehydrogenase (GAPDH) was amplified by 
PGR. 

[0238] PGR product was subjected to the gel electrophoresis by ethidium bromide staining. Figure 1 (a) and Figure 
5 2 show the results. 

[0239] To confimn the presence or absence of the amplification of a circular DNA including the looped-out S region, 
South em hybridization was conducted against the PGR product by using mouse Sa region gene for a hybridization 
probe, according to the standardmethod (L. Sambrook E. R, Tom Maniatis., Second edition, Ed. Molecular Gloning 
(Nolan; G., Ed.) Gold Spring Hari3or, 1989). As an Sa gene, a 1,155 bp DNA fragment obtained by digesting 10 kb 
10 EcoRI cleaved fragment lgH703 with Hind Ml and Eari was used (Genbank #D11468, DNA No. 1993-3148) (J, Biol. 
Ghem., Vol. 268, p. 4651-4665). Figures 1 (b) and 2 (b) show the results. 

[0240] It has been shown that mouse B cell GIH2F3-2 produces the looped-out DNA containing the sa sequence with 
the class switch recombination by the stimulation with cytokine, and the production of the DNA is inhibited by the 
presence of cycloheximlde. This result suggested that occurrence of class switch recombination of an immunoglobulin 
IS gene needs a novel synthesis of a protein In the very early stage after the stimulation and the protein is deeply involved 
in the induction of the dass switch. 

Example 2: Identification of a gene which expression Is Improved in mouse B cell CH12F3-2 stimulated by cytokine 

20 [0241] A gene which is presumably expressed in the early stage after mouse B cell clone CH12F3-2 is stimulated, 
and presumably play a role of Introducing class switch recombination of an immunoglobulin gene was attempted to be 
isolate from the GH12F3-2 cells by the suppression subtract hybridization (SSH) (Proc. Natl. Acad. Sci. USA, Vol. 93, 
p. 6025-6030, 1 996; Anal. Biochem., Vol. 240, p. 90-97, 1 996) using the inhibitory PGR effect (Nucleic Acids Res.. Vol. 
23,p. 1087-1088,1995). 

25 [0242] A cDNA library necessary for subtraction cloning was prepared by using PGR-Select Subtraction Kit (GLON- 
TEGH, Gatalogue NO: K1 804-1) by following the instruction manual supplemented with the kit in the experimental 
manipulation. 

[0243] PolyA+RNA was isolated from each of mouse B cell clone GH12F3-2 stimulated with IL-4, TGF-p and CD40L 
for 5 hours, the same cells stimulated with the cytokines for 12 hours, and the cells which were not stimulated, by 

30 following the reported method (Nucleic Acids Res., Vol. 26, No. 4, p. 911-918, 1998) and treated with DNasel to eliminate 
genomic DNA whteh may be mixed. Then cDNA was prepared based on each polyA-^RNA sample using reverse tran- 
scriptase according to the standard method. Each cDNA prepared from mouse B cell clone GH12F3-2, treated with 
the above cytokines for 5 or 1 2 hours was mixed with same mole amounts to be used as a tester cDNA. On the other 
hand, cDNA derived from unstimulated cells was used as a driver cDNA. 

35 [0244] Subtraction was conducted by adding the driver CDNA into the tester cDNA according to the above previous 
report and the experimental manipulation manual. The efficiency of subtraction was monitored by adding a small amount 
(1:1000 mole ratio) of ^ X174 phage DNA cleaved at the restriction enzyme site Hae III, as a control. Into the tester 
cDNA. After the subtraction, the phage DNA was concentrated to a mole ratio of about 100 times. 
[0245] The subtracted cDNA was cloned to T-vector (Promega) according to the standard method to prepare a plas- 

40 mid library. In the same manner as in the previous report, 2000 colonies in the library were screened by the differential 
hybridization method (Nucleb Adds Res., Vol. 26, No. 4, p. 911-918, 1998; Medical immunity, Vol. 29, No. Suppl. 17, 
p. 451-459, 1997). Each of the above tester cDNA and driver cDNA was radiolabeled to be used for hybridization. 
Glones including ((> XI 74 phage DNA were selected by hybridizing 74 phage DNA with a replicant filter. 
[0246] One hundred fifteen clones emitting a stronger signal than the radio-labeled driver cDN A probe against radio- 

45 labeled tester cDNA probe were identified and a nucleotide sequence of each clone was detemnlned by using a DNA 
sequencer. 

[0247] Northern blotting was conducted against mRNA obtained from mouse B cell clone GH1 2F3-2 stimulated with 
. IL-4, TGF-p and GD40L or the same lines unstimulated, using the radio-labeled DNA inserted into the each clone as 
a probe, according to the standard method (L. Sambrook, E. F., Tom Maniatis., Second edition, Ed. Molecular Gloning 
50 (Nolan, G., Ed.), Gold Spring Harbour, 1989). As a result, the enhanced expression corresponding to the stimulation 
with the above cytokines was observed In 23 among 115 dories. Gene fragments coding 7 different types of proteins, 
including genes coding the 3 kinds of known proteins and 4 kinds of novel proteins were found to be inserted into the 
23 clones. Specifically, the expression of the 7 kinds of genes were found to be enhanced In mouse B cell clone 
GH12F3-2 by the stimulation with IL-4, TGF-p and GD40L. 

55 
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<The known proteins> 
[0248] 

5 ABCD-1/MDC (8 Clones) 

IFNy receptor (2 clones) 
t-a (MHC class II) (1 clone) 

<Novel proteins> 

10 

[0249] 

23C9 (3 clones) 
15B11 (7 clones) 
IS 8B9 (1 clone) 

18A9(1 done 

[0250] As it has been known that the expression of the above l-a gene and ABCD/MDC gene is enhanced by stim- 
ulating mouse spleen B cell with IL-4 and CD40L, it was conf imned that the subtraction cloning was effectively conducted 
20 (J. Exp. Med., Vol. 188. No. 3. p. 451-463, 1998; Immunity, Vol. 5, No. 4. p. 319-330. 1996) 

Example 3: Expression of mRNA for a novel protein 23C9 in mouse B cell clone CH12F3-2 

[0251] The degree of enhanced expression of gene coding a novel protein 23C9 in mouse B cell clone CH12F3-2 
25 stimulated with tL-4, TGF-p and CD40L was analyzed according to the standard method (L. Sambrook, E. P., Tom 
Maniatis., Second edition, Ed. Molecular Cloning (Nolan, C, Ed.), Cold Spring Harbour, 1989) by Northern blotting. 
[0252] Mouse B cell clone CH12F3-2 was cultured in the presence of one of the following regents for 12 hours. 

(1) IL-4. TGF-P and CD40L only 
30 (2) CyclohexImide whbh is a protein synthesis inhibitor (200 ng / ml), only 

(3) IL-4, TGF-P and CD40L as well as Cydoheximlde (200 ng / ml) 

[0253] Northern blotting was conducted against mRNA (1 0 \Lg for each group) obtained in the same manner as the 
previous report (Nucleic Acid Res., Vol. 26, No. 4, p. 91 1 -91 8, 1 998) from each group of treated cells using a radio- 
35 labeled cDNA fragment (1 ,020 bp)codlng a novel protein 23C9, obtained in the above Example, according to the stand- 
ard method (L. Sambrook, E. P., TomManiatis.. Second edition, Ed. Molecular Cloning (Nolan. C, Ed.), Cold Spring 
Harbour, 1989). 

[0254] As a control examination, Northern blotting was conducted for mRNA derived from B cell clone CH12P3-2 
cultured without any one of the above cytokines, or cycloheximide. 
40 [0255] Ihe amount of mRNA to be electrophoresed was adjusted using the amount of mRNA in glyceraldehyde- 
3-phosphate dehydrogenase (GAPDH) as an Index. DNA amplified by RT-PCR using GP primer and GR primer was 
used as a probe for blotting of GAPDH mRNA (Location of nucleotides: 566-1 01 6, Genbank U5299) (immunity, Vol. 9, 
P. MO, 1998). 

[0256] Figures 3 and 4 show the results. 
^ [0257] The expression of mRNA for a novel protein 23C9 was extremely strong in mouse B cell clone CH12P3-2 

stimulated with IL-4, TGF-p and CD40L, while the expression in unstimulated cells was extremely weak. Expression 

of the mRNA In the stimulated cells was Inhibited by the presence of a protein synthesis inhibitor. Moreover, in the 

stimulated cells, two bands indicating the expression of mRNA comprising different nucleotide lengths were detected. 

[0258] Expression of mRNA for a novel protein 23C9 in each of the following mouse cell lines which do not originally 
50 comprise class switch recombination was exarnined by Northem.blotting in the same manner as in the above. 

[0259] B cell lines (iyD9, BA/P3, 702/3, WEH1231), T cell lines (EL-4, 2B4), myeloma cell lines (X63, HEHI-3). R- 

broblast lines (L929, NIH3T3,) the other cell lines (P2. PB15, ST2). 

[0260] The expression of mRNA for the novel protein 23 C 9 was not observed in any cells. 

55 Example 4: Cloning of a full length cDNA coding a novel protein 23C9. 

[0261] Four different positive clones were obtained by screening cDN A library (Nucleic Acids Res., Vol. 26, No. 4, 
p. 91 1 -91 8, 1 998) prepared from mouse B cell clone CHI 2P3-2 stimulated with IL-4. TGP-p, and CD40L, using a cDNA 
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fragment (1 ,020 bp) coding the novel protein 23C9, obtained in the above Example as a probe. A nucleotide sequence 
of each clone was determined by using a DNA sequencer according to the standard method. 
[0262] One clone comprises a 1 .2 kb nucleotide length and a single reading frame (OPF) with 1 polyadenylation site. 
The other 3 clone comprise a 2.4 kb nucleotide length and 2 polyadenylation sites. A nucleotide sequence in the 1 .2 
5 kb part at the 5' side in the latter clones was identk:al to that of the nucleotides in the 1 .2 kb DNA in the fomier (SEQ 
IDN0:1). 

[0263] Two different mRNA transcripts detected in Northern blotting in the above Example (Figures 3 and 4) were 
predicted to con-espond to transcripts for each of the above 1 .2kb and 2.4 kb, transcribed using the polyA site at 3' 
end and the polyA site at the 5'end. 
10 [0264] A cDNA fragment coding the novel protein 23C9 used as a probe In the above (1 ,020 bp) was found to be a 
nucleotide sequence of from 847 to 1 B66 in the full length cDNA of 23C9. 

[0265] A nucleotide sequence near a first initiation codon in each cDNA was fit to Kozak's rule (NucleicAcids Res., 
Vol. 15, No. 20, p. 8125-8148, 1987). lnthe2.4kbcDNA, ATTTAwhfch isamotif capable of mediating quick degradation 
of mRNA (Blood, Vol. 83, No. 11 , p. 31 82-3187, 1 994) was present In the 2 sites in the untranslational region in the 3'stde. 

IS [0266] An open reading frame (ORF) of cDNA coding the novel protein 23C9 consisted of 1 98 amino adds with the 
expected molecular weight of about 24kDa (SEQ ID NO: 2). As a result of homology searching with known proteins 
by database, an amino acid sequence of ORF of the novel protein 23C9 comprised 34% amino acid homology with 
apolipoprotein B mRNA editing enzyme, catalytic polypeptide-1 (APOBEC-1) (Science, Vol. 260, No. 5115, p. 
1816-1819. 1993, J. Biol. Chem., Vol. 268 No. 28. p. 20709-20712. 1993). GenBank and EMBL were used as DNA 

20 data base. SwissPlotwas used as protein database. BLAST program (J. Mol. Biol., Vol. 215, No. 3, p. 403-410, 1990) 
and FASTA program (Proc. Natl. Acad. Sci. USA., Vol. 85, No. 8, p. 2444-2448 1988) were used for database search. 
[0267] Figure 5 shows an amino acid sequence of ORF of the novel protein 2309 and an alignment between the 
sequence and that of mouse APOBEC-1 amino acid sequence. 

[0268] As a result of motif search on online using PROSITE (Nucleic Acids Res., Vol. 11 No. 20, p. 2013-2018, 1992), 
25 the APOBEC-1 like novel protein 23C9 comprises cytldlne/deoxycytldlne deaminase motif whteh is conserved in a 
amino acid sequence of a protein belonging to cytosine nucleoside/ nucleotide deaminase family which constructs a 
large family and is an activation site off a deaminase activity. A cytosine nucleoside/nucleotide deaminase family is 
classified Into RNA editing deaminase, cytidine/deoxycytidylate deaminase, and CMP/dCMP deaminase based on the 
substrate specificity and homology in the activation sites (Ceil, Vol. 81 , No. 2, p. 187-195, 1995). 
30 [0269] A phylogenk: tree was prepared based on the alignment among regions in APOBEC-1 which Is an RNA editing 
deaminase, cytosine nucleoside deaminase, cytosine nucleotide deaminase, and cytldine deaminase motif In the novel 
protein 23C9. The sequences in the known proteins used for the comparison were obtained fromGenBank, as follows. 

Human derived nucleoside deaminase: L.27943 
35 Mouse derived nucleoside deaminase: AA388666 

S. subtilis derived nucleoside deaminase: U18532 

E. coll derived cyttdine deaminase: X63144 

Rabbit derived APOBEC-1 : U 1 0695 

Human derived APOBEC-1 : L25877 
40 Rat derived APOBEC-1: U10695 

Mouse derived APOBEC-1 : U21951 

T2/T4 phage derived nucleotide deaminase: J05172 

Human derived nucleotide deaminase: LI 21 36 

S. cerevisiae derived nucleotide deaminase: U 10397 

45 

[0270] Figure 6 shows the result. Cytidine deaminase motif in the novel protein 23C9 was rather relative to a subgroup 
of RNA editing deaminase than subgroups of nucleoside deaminase and nucleotide deaminase. 
[0271] On the other hand, a leucine-rich region existing at the C-temiinus of APOBEC-1 is thought to be important 
for protein-protein interaction (Proc. Natl. Acad. Sci. USA., Vol. 91, No. 16. p. 8522-8526, 1994; J. Biol. Chem.. Vol. 
so 269. No. 34, p. 21725-21734, 1994). The novel protein 23C9 also comprised a leucine-rich region at. the C-tennlnus. 
Four leucines In the region o 23C9 were consen/ed In the leucine rich regions of APOBEC-1 In rabbit, rat, mouse and 
human. 

[0272] It has been known that Phe66, PheB7, His61 , Glu63 and Cys93 are essential for binding of APOBEC-1 to 
RNA, and all these amino acid residues were consen/ed in the primary structure of 23C9 (Trends Genet., Vol. 12, No. 
S5 10 p. 418-424, 1996; Cell, Vol. 81. No. 2. p. 187-195. 1995. J. Biol. Chem., Vol. 270 No. 24, p. 14768-14775 1995; J. 
Biol. Chem., Vol. 270, No. 24, p. 14762-14767, 1995). From this fact. 23C9 protein Is predfcled to comprise an RNA 
editing deaminase activity. 

[0273] Moreover, cytidine deaminase derived from APOBEC-1 and E. coli (ECCDA) are known to comprise a pseu- 
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doactive site domain at the C-terminus and the 23C9 protein also comprised a pseudoactive site domain same as in . 
the APOBEC-1 . This indicates that 23C9 protein is more relative to APOBEC-1 and ECCDA than deaminase proteins 
In the other groups. 

[0274] From these facts, the novel protein 23C9 was named activation-induced cytidine deaminase (AID). The novel 
5 protein 23C9 was called AID hereafter. 

Example 5: Preparation of the AID-GST fusion protein 

[0275] The cDNA coding a full length AID cloned in the above Example was amplified by PGR with a pair of primers, 
10 AID-13B (SEQ ID NO: 3) and AID-161 (SEQ ID NO: 4), a pair of primers, AID-11B (SEQ ID NO: 5) and AID-119. (SEQ 
ID NO: 6) and Taq Polymerase by following the standard method. As there is an intron between AID-118 and AID-119, 
a PGR product derived from AID genomic DNA can be easily distinguished. 

[0276] The obtained PGR product was subcloned to pGEX4T1 vector (Phamiacia) according to the standard method. 

A nucleotide sequence of the vector was detemnined and the absence of point mutation derh^ed from the use of Taq 
15 polymerase in the full length AID cDNA cloned to the vector was confirmed. 

[0277] E. CO// DH5a was transfomied with the vector according to the standard method. The obtained transformants 

were cultured, and a full length AID cDNA was expressed as a fusion protein with glutathione S-transferase (GST) . 

The AID-GST fusion protein was extracted In the same manner as in the previous report, and purified using glutathione 

agarose affinity chromatography (J. Biol. Ghem., Vol. 270, No. 24, p. 14768-14775 1995). 
20 [0278] A molecular weight of the purified AID-GST fusion protein was analyzed by following the standard method 

using 10% SDS-PAGE and silver staining. A protein extracted from wild type E. coll DH5a was used as a control. 

Figure 7 shows the result. 

[0279] As expected, the fusion protein was detected as a band comprising a molecular weight of about 49 kDa. Minor 
bands detected under the about 49 kDa were thought to be decomposed proteins, frequently generated in the purifi- 
es cation process In general. 

[0280] A molecular weight of the purified AID-GST fusion protein was analyzed by the Western blotting method 
according to the standard method (Genomics, Vol. 54, No. 1 , p. 89-98, 1 998). Anti-AID protein antibody to be used for 
the assay was prepared by Immunizing a comrnercial rabbit for the experiment with multiple antigen peptides including 
synthetic peptides corresponding to amino acid NO: 1 16 to 132 of the AID protein of the present invention (Proc. Natl. 
30 Acad. Sci. USA., Vol. 85, No. 15, p. 5409. 1 988). 
■ [0261] Figure 8 shows the result. 

Example 6: Cytidine deaminase activity of the AID protein 

3S [0282] A cytidine deaminase activity of AID was measured by the same method as In the previous report (J Biol. 
Chem. Vol. 270, No. 24, p. 1476B-14775, 1995). 

[0283] The purified AID-GST fusion protein prepared In the above (2, 4, 6, 8, 10, 20, 40, 60, 100, 200, 300, 400, and 
600 ng) was incubated In the buffer (pH 7.5, the total amount 1 0 containing 45mM Tris with 3.3 ^iCi pH] deoxycytidine 
(24.8 Gi / mmol, DuPont) and 250 \xM cytidine for 2 to 4 hours. The reaction was temiinated by adding deoxycytidine 

40 (2 ^1 of 10 M.g/ml) and deoxyuridine (2 ^1 of 10 ^g/ml). Insoluble substances were removed by centrlfugatlon, and the 
reaction mixture (4 pJ) was subjected to the polyethylene imine-cellulose thin layer chromatography plate (VWR). The 
plate was developed in isopropyl alcohol / 10% HGI (7:2 vA^). The plate was exposed to ultraviolet light (254 nm) for 
visualization and bands corresponding to deoxycytidine and deoxyuridine were collected, added to Ultima Gold scin- 
tillation solution to be quantified by liquid scintillation photometer (Packard) 

45 [0284] Rgure 9 shows the result. As a result, AID protein showed the cytidine deaminase activity depending on the 
concentrations. 

[0285] An inhibitory effect of tetrahydrouridine (THU; 0 to 40 p.M) (Calbiochem, USA) which is an inhibitor specific 
to cytidine deaminase, on the cytidine deaminase activity In the AID-GST fusion protein (300 ng) was measured by 
the same method described above. 
50 [0286] Figure 1 0 shows the result. The cytidine deaminase activity of AID protein was inhibited dependently on the 
concentrations of THU. 

[0287] Each inhibitory effect of 1,10-o-phenanthroline (0 to 20 mM) which is a zinc-chelating agent and its Inactive 
form Isomer 1 ,7-o-phenanthrollne (0 to 20 mM) on the cytidine deaminase activity In the AID-GST fusion protein was 
measured in the same manner as described below. 
55 [0288] Figure 11 shows the result. The cytidine deaminase activity of AID protein was Inhibited by 20 mM 1 ,10-a- 
phenanthroline by about 91 %. 1 ,7-o-phenanth roline which Is the inactive isomer on ly inhibited about 1 3%. These results 
indicate that AID protein is a zinc-dependent cytidine deaminase, same as APOBG-1 . 
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Example 7: Avidity of AID protein with AU-rich RNA 

[0269] A recombinant APOBEC-1 binds to Au-rich RNA (Trends Genet., Vol. 12, No. 10, p. 418-424, 1996; Cell, Vol. 
81, No. 2. p. 187-195, 1995, J. Biol. Chem.. Vol. 270, No. 24, p. 14768-14775, 1995; J. Biol, Chem.. Vol. 270, No. 24, 
5 p. 1 4762-1 4767. 1 995), and progresses RNA editing for apoB In the presence of chicken extract Including co-factor. 
[0290] Since the AID protein has a functional cytidine deaminase activity as well as a structural similarity with 
AP0BEO1 , to examine an RNA editing activity in the AID protein, avidity to AU-rich RNA {5-AU) and apoB RNA which 
are RNA substrate for APOBEC-1 was examined. 

[0291 1 The AID protein did not show avidity to AU-rich RNA (5-AU) in the gel retardation assay. In in vitro apoB RNA 
10 assay, conversion from cytidine (C) to uridine (U) was not observed. 

Example 8: Expression distribution of AID mRNA in tissues 

[0292] The expression of AID mRNA in each tissue was examined by Northern blotting according to the standard 
15 method (L. Sambrook, E. F., Tom Maniatis. , Second edition, Ed. Molecular Cloning (Nolan C, Ed.). Cold Spring Harbour, 
1989; Experimental Medicine, Suppl,., "Genetic Engineering Hand Book", published by Yodosha, p. 133-140, 1992). 
[0293] PolyA+RNA (2 jig each) obtained from cells derived from each tissue in mfce (muscle, spleen, lung, heart, 
lymph node, brain, kidney, thymus, testis, liver) according to the previous report (Nucleic Acids Res., Vol. 26, No. 4, 
p. 911-918. 1996) was used as a sample. Radiolabeled cDNA fragment (1 ,020 bp) coding AID (23C9) obtained in the 
20 previous Examples was used as a probe for blotting polyA'*'RNA. 

[0294] As a control, mRNA of glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was blotted in the same man- 
ner. AS a probe for blotting GAPDH mRNA, DNA amplified by PCR using GP primer and GR primer was used. (Nu- 
cleotide location: 566-1016, Genbank, U52599) (Immunity. Vol. 9, p.1-10, 1988). 
[0295] Figure 1 2 shows the result. 
25 [0296] As a result. AID mRNA was strongly expressed In mesenteric lymph node. In addition, weak expression was 
observed in spleen. 

Example 9: Expression of AID mRNA In various lymphatic tissues. 

30 [0297] The expression of AID mRNA in each lymphatic tissue was analyzed by RT-PCR according to the standard 
method (Immunity, Vol.9, p. 1-10, 1998). 

[0298] cDNA was prepared according to the standard method using polyA+RNA obtained from cells derived from 
various lymphatic tissues (Payer's patch, mesenteric lymph node, axillary lymph node, spleen, bone marrow, thymus ) 
in the same manner as in the previous report (Nucleic Acids Res., Vol. 26 No. 4, p. 911-918, .1998). for mRNA as a 

35 sample, as a template. AID cDNA and GAPDH cDNA were amplified using the obtained cDNA as a template. The pair 
of primers, AID-138 (SEQ ID NO: 3) and AID-161 (SEQ ID NO. 4) in the above, a pair of primers AID-118 (SEQ ID 
NO: 5) and Al D-1 1 9 (SEQ ID NO: 6) and Taq polymerase were used for PCR of AID cDNA. As there is an intron between 
AID-11 8 and AID-1 1 9, a PCR product derived from the AID genomic DNA sequence can be easily distinguished. 
[0299] Figure 13 shows the result. 

40 [0300] AID cDNA was detected in all lymphatic tissues except for thymus. In particular, the obvious expression was 
observed in peripheral lymphatic organs, such as lymph node or Payer's patch. On the other hand, the expression in 
primary lymphatic organs was weak in comparison with that in the peripheral lymphatic organs. 

Example 1 0: Expression of AID mRNA as time goes in activated mouse B cell clone CH12F3-'2 

45 

[0301] Expression of AID mRNA as time goes in activated mouse B cell clone CH12F3-2 stimulated with IL-4 , TGF- 
p, and CD40L for 0 to 6o hours was analyzed by Northem blotting according to the standard method (L. Sambrook, 
E . F . , Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Hartaor, 1 989). 
[0302] Mouse B cell clone CHI 2F3-2 was cultured in tiie presence of IL-4, TGF-p, and CD40L for the various periods 

50 (0, 3, 5, 12, 24, 36 48 or 60 hours). 

[0303] Northern blotting was conducted against mRNA (10 jig in each group) obtained from each culture group in 
the same manner as in the previous report (Nucleic Acids Res., Vol. 26, No. 4 p. 911-918, 1998) using a radio-labeled 
cDNA fragment coding AID (23C9) obtained in the previous Examples, as a probe, according to the standard method 
(L Sambrook. E. R.Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Harbor. 1989). 

55 [0304] The amount of mRNA to be get-electrophoresed was adjusted by using mRNA of GAPDH as an index. DNA 
amplified by RT-PCR using GF primer and GR primer was used as a probe for blotting GAPDH mRNA (Nucleotide 
location: 566-1016. Genbank U52599) (Immunity, Vol. 9, p. 1-10, 199B). 
[0305] Figure 1 4 shows the result. 
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[0306] ItwasshownthatTheexpressionof AiDmRNAinmouseBceildoneCH12F3-2wastoosmalltobedete^^ 
without the stimulation by cytokines, but that the expression was Initiated 3 hours after the stimulation by cytokines 
(described in the above), was maximum 12 hours after the stimulation (more than about 15 times), and was gradually 
decreased from 48 hours after the stimulation. 

5 

Example 11 : Cytokine specificity to inducing expression of AID mRNA In mouse B ceil clone CH12F3-2. 

[0307] Cytokine spedficity to inducing expression of AID mRNA in mouse B cell clone CH12F3-2 was analyzed by 
Northern blotting according to the standard method L. Sambrook, E. R, Tom Maniatis., Second edition Ed. Molecular 

10 Cloning (Nolan, C. Ed.). Cold Spring Harbour, 19B9). 

[0308] Mouse B cell done CHI 2F3-2 was cultured in the presence of various combinations of cytokines (one or more 
selected from IL-4 TGF-p, CD40-L) for 12 hours. Northem blotting was conducted against mRNA (1 0 \ig in each group) 
obtained from each culture group in the same manner as in the previous report (Nudeic Acids Res., Vol. 26, No. 4 p. 
91 1 -91 8, 1 998) using a radio-labeled cDNA fragment (1 .020 bp) coding AID (23C9) obtained in the previous Example, 

IS according to the standard method (L. Sambrook, E. F., TomManiatls., Second edition Ed. Molecular Cloning (Nolan, 
C, Ed.). Cold Spring Harbor. 1989). 

[0309] The amount of mRNA to be gel-eiectrophoresed was adjusted by using mRNA of GAPDH as an Index. DNA 
amplified by RT-PCR using GF primer and GR primer was used as a probe for blotting GAPDH mRNA (Nucleotide 
location: 566-1016, Genbank U52599) (Immunity, Vol. 9, p. 1-10, 1998). 
20 [0310] Figure 15 shows the result 

[0311] Expression induction of AID-mRNA was smalt by solely any one kind of cytokines. On the other hand, when 
3 kinds of cytokines descn'bed above were used at the same time, the maximum expression induction of AID-mRNA 
was observed. 

[0312] As described in the above Example 3. because expression induction of AID mRNA was inhibited by cyclohex- 
25 Imlde which Is an Inhibitor for protein synthesis, it Is hypothesized that enhanced expression of AID mRNA needs de 
nova synthesis. 

Example 12: Expression Induction of AID mRNA In spleen B cell by stimulation 

30 [0313] The presence or at)sence of expression induction of AID mRNA by stimulation which may activate B cell and 
Induce class switch recombination of immunoglobulin was examined. 

[0314] Spleen B cell was purified and obtained from BALB/c mouse (6 to 1 2-week old, Shimlzu Experimental Materials 
(SLC)) according to the standard method. Dead cells and cell fragments were removed by Ficoll density gradient 
centrifugation after the process of removing T cells. The purified spleen B cell was cultured for 4 days in the presence 
35 of a stimulus In various combinations (one or more selected from lL-4, TGF-p, CD40L and LPS (llpopolysaccharide) 
in the same manner as in the previous report (Nucleic Acids Res. , Vol. 26, No. 4, p. 91 1 -91 8, 1 998) . LPS derived from 
Salmonella typhosa (50 ^g/ml. Sigma) was used. 

[0315] Northem blotting was conducted against mRNA (15 ^ig in each group) obtained from each culture group in 
the same manner as in the previous report (Nucleic Adds Res., Vol. 26, No. 4 p. 911 -91 B, 1 998) using a radio-labeled 

40 cDNA fragment coding AID (23C9) obtained in the previous Example, according to the standard method (L. Sambrook, 
E. R, Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Hart>or, 1989). 
[0316] The amount of mRNA to be gel-electrophoresed was adjusted by using mRNA of GAPDH and 2BS ribosomal 
RNA as an index. DNA amplified by RT-PCR using GF primer and GR primer was used as a probe for blotting GAP- 
DHmRNA (Nucleotide location: 566-1016, Genbank U52599) (Immunity, vol. 9, p. 1-10, 1998). 

45 [0317] Figure 1 6 shows the result 

[0318] The enhanced expression of AID mRNA by the stimulation with LPS only or LPS+IL-4, or LPS+TGF-p was 
obsen^ed in normal mouse spleen B cells. ' 

Example 13: Induced expression of AID mRNA in vivo 

50 

[0319] It was examined whether the expression induction of AID mRNA by various stimulation in vitro would occur 
In vivo. 

[0320] BALB/c mouse (6 to 12-week old, five individuals in each group, SLC) was immunized by intraperitoneal ly 
administering sheep red blood cell (SRBC) (1X10^ cells. Cosmo Bio.) . In the living body immunized by SRBC, it has 
55 been known that clonal expansion and germinal center fomiation occur after Immunoresponse, and class switch re- 
combination of an immunoglobulin gene and affinity maturation are caused. 

[0321] PotyA-^RNA was prepared from splenocytes isolated from spleen exdsed from each individual before (day 0) 
and after (day 2. 5, and 13) the immunization. 
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[0322] The PotyA-^RNA (2 each) was subjected to the Northern blotting using the radiolabeled cDNA fragment 
(1 ,020 bp) coding AID (23C9) as a probe in the same manner as the above Examples. The amount of mRNA to be 
gel-electrophoresed was adjusted using mRNA of GAPDH as an index in the same manner as In the above Exannples. 
[0323] Figure 1 7 shows the result. 
5 [0324] The minimum amount of expressed AID mRNA was detected before immunization of SRBC (day 0), however, 
a significant enhancement of expression (about 4 to 5 times) was observed day 5 and day 13 after the immunization. 
[0325] Moreover, to analyze in which cell type enhanced expression of AID mRNA occurs, RT-PCR was conducted 
by the standard method (Immunity, Vol. 9, p. 1-10, 1998). 

[0326] Red blood cells were removed from splenocytes obtained from spleen which was obtained 5 days after the 
10 immunization of SRBC in the same manner as the above, and T cells and non- T cells were separated using nylon 
fiber (Wako Pure Chemicals) in the same manner as in the previous report (Eur. J. Immunol., Vol. 3, No. 1 0, p. 645-649, 
1973). T cell fraction contained more than 90% of CD3 positive cells, and less than 20% B 220 positive cells. T-cell 
fraction (removal of B cells) and B-cell fraction were concentrated by MACS method with magnetic beads conjugated 
to anti-CD19 antibody (Miltenyi Biotech.). B220 positive B ceils included in the fraction in which T cells were removed 
15 were 5% or less. On the other hand, B220 positive B cells included In the fraction In which CD19 positive cells were 
concentrated were 60% or more. 

[0327] cDNA was prepared by reverse transcriptase according to the standard method using polyA-'^RNA prepared 
from each fractionated cell group. AID cDNA and GAPDH cDNA were amplified by PGR using the obtained cDNA as 
a template. For PCR of AID cDNA, the previously described pair of primers, AID-138 (SEQ ID NO: 3) and AID-161 
20 (SEQ ID NO: 4), and the previously described pair of primers, AID-116 (SEQ ID NO: 5) and AID-11 9 (SEQ ID NO: 6). 
as well as Taq polymerase were used. 
[0328] Figure 1 6 shows the result. 

[0329] As a result, in the CD1 9 positive B cell fraction and non-Tcell fraction, amplification of AID cDN A was observed. 
Specifically, it was demonstrated that enhanced expression of AID mRNA induced by immunization by SRBC occurs 
25 In spleen CD1 9 positive B celts. 

Example 14: Localization of AID mRNA expression in lymphatic organs 

[0330] It was found that timing of enhanced expression of AID mRNA in spleen almost consistent with the initiation 
30 of germinal center (GC) fomiation after immunization of SRBC, from the result of the previous Examples. In this ex- 
amination, an precise localization of AID mRNA expression In lymphatic organs was analyzed using in situ hybridization. 
[0331] AID cDNA cleaved out by digesting pGEX4T1 vector in which cDNA coding AID protein has been subcloned, 
with EcoRI and Xhol was subcloned into plasmid pBluesciptSK (+) (Stratagene). The plasmid was digested with EcoRI 
or Xhol to obtain linearized plasmid DNA and transcribed into RNA using the plasmid as a template in the presence 
35 of digoxigenin-labeled rUTP (Boehringer-Mannheim) using T3 RNA polymerase or T7 RNA polymerase to prepare 
each of digoxigenin-labeled antisense probe and sense probe. 

[0332] On the other hand, frozen tissue slices were prepared by immobilizing with paraformaldehyde from each of 
spleen and payer's patch in a normal mouse as a lymphatic organ sample. A nonnal mouse was immunized with SRBC 
in the same manner as in the above Examples, and frozen tissue slices immobilized with paraformaldehyde from spleen 

40 obtained 5 day after the immunization. 

[0333] Hybridization was conducted by applying the digoxigenin-labeled antisense AID probe or sense AID probe 
to each of the slides fumlshed with each of immobilized slices. Hybridized digoxigenin-labeled AID probe was detected 
using antl-digoxigenin antibody conjugated with alkaline phosphatase. The localization of anti-digoxigenin antibody 
conjugated to digoxigenin on the probe was identified by detecting a phosphatase reactant (dark purple color). This 

45 analysis was conducted using a light transmission microscope. 

[0334] In situ hybridization and detection of riboprobe in this examination were conducted In the same manner as in 
the previous report (J, Comp. Neurol., Vol. 333, No. 3, p. 398-416, 1993). 

[0335] The location of germinal center in each tissue slice was identified by staining with PNA (Vector) conjugate 
with FITC and observing with a inflorescent microscope. 

50 [0336] Figures 1 9 and 20 show the result. 

[0337] In the examination using the antisense AID probe, multiple obvious focal signals were observed in spleen 
tissue slices derived from SRBC immunized mouse (day 5 after the immunization) (Figure 1 9 (E) and 20 (E)), however., 
any signals were not detected in spleen tissue slices derived from mice which were not immunized with SRBC (Figure 
19 (B) and 20 (B)). This result is consistent with the result of Northem blotting obtained in the above Example (Figure 

55 17). Existence of gemrtinal center was obsen^ed both in spleen tissue slices derived from SRBC-lmmunized mouse 
(day 5 after the immunization) (Figures 19 (F) and 20 (F)) as well as in the normal payer's patch (Rgure 20 (I)) , by 
staining with FITC-labeled PNA. The expression of AID mRNA was found to localize in gemnlnal center in the both 
tissue slices . 
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[0338] In the examination using sense AID probe, any signals as background were not detected in tissue slices of 
spleen or of payer's patch regardless of presence or absence of the immunization by SRBC. 
[0339] This result indicates that expression induction of AID mRNA occurs in specifically to gemninal center B cells, 
activated by stimulation with an antigen. 

5 

Example 15: Isolation of genomic DNA coding AID protein derived from human 
<15-1> Preparation of probes for hybridization 

10 [0340] PGR was conducted using an expression vector prepared by inserting cDNA coding a full length mouse AID 
protein, prepared In Example 5 into a plasmid vector pGEX4T1 , as a template with a pair of primers (Primer 170: SEQ 
ID NO: 16 and primer 181: SEQ ID NO: 179, according to the standard method described in the above. 
[0341 1 The obtained PGR product was purified by the standard method described above and a nucleotide sequence 
of the purified DNA was determined by the direct sequence method to confimri that the purified DNA is the nucleotide 

IS sequence coding a full length of mouse AID protein. This purified DNA was used as a probe for hybridization in the 
following experiments. 

<15-2> Screening of human genomic DNA library 

20 [0342] The probe prepared in the above was labeled in the same manner as for the radioactive probe in the above 
Northern hybridization to make a pmbe radio-labeled by a radioactive isomer. 

[0343] Using the labeled probe, a commercial human genomic DNA library (catalogue No. HL1067j; Lot No. 45003; 
GLONTECH) was screened by the cross hybridization according the standard method. 

[0344] Washing after the hybridization was conducted twice in 2 X SSG (including 0.1 % SDS. under the room tem- 
25 perature, 1 0 min). and twice in 2 X SSG^ (including 0.1 % SDS, 65''C, 30 min). Phage DNA was subcloned by purifying 

phage DNA and inserting about 22 kb genome DNA obtained by cleaving at NotI restriction enzyme site in the phage 

DNA, into Not 1 restriction enzyme site in plasmid pZero-2.1 . This plasmid was named SCpZero. 

[0345] A DNA fragment obtained by digesting SGPZero with PstI was ligated to the PstI site of plasmid pBlueScript 

KS (Toyobo) and E. coli was transformed with this ligated DNA. 
30 [0346] Transfomnants were screened by the colony hybridization using the labeled probe prepared in the above 

according to the standard method, and multiple positive clones were obtained. 

[0347] A nucleotide sequence of human gendmb DNA inserted into each positive clone was analyzed and multiple 
clones containing genomic DNA of DNA coding human AID protein were identified. 

[0348] Among the multiple clones, nucleotide sequences of genomic DNA containing DNA coding human AID protein 
35 contained In two clones were described in SEQ ID NOs: 9 and 1 0, respectively. 

[0349] Moreover, a nucleotide sequence of genomic DNA Including DNA coding human AID protein included in the 
positive other clone was shown in SEQ ID No: 35. 

Example 1 6: Isolation of cDNA coding a full-length human AID protein and preparation of human AID protein 

40 

[0350] By comparing a nucleotide sequence of genomic DNA including a coding region of the obtained human AID 
protein with cDNA nucleotide sequence coding a full-length mouse AID protein determined in the above, a human AID 
protein coding region in the human genomic DNA was deduced. 

[0351] A pair of primers for RAGE-PGR was designed based on the deduced nucleotide sequence of the coding 
45 region in the human AID protein (Primer 22: SEQ ID NO: IB, and primer 25: SEQ ID NO: 19). 

[0352] F^GE-PGR was conducted using mRNA prepared from human B Lymphoma cell line RAMOS as a template 
with the above pair of primers according to the previous report (J. Biol. Ghem., Vol. 274, p. 18470-18476. 1999) by 
following the standard method. A nucleotide sequence of the obtained PGR product was determined and cDNA coding 
a full length human AID protein was obtained (cDNA sequence: SEQ ID NO: 7. and amino acid sequence: SEQ ID B). 
50 [0353] As a result, human AID protein (SEQ ID NO: 8) has extremely high homology in amino acid sequences with 
a mouse AID protein (SEQ ID NO: 2) (Figure 22) . Amino acid sequences in Cytidlne and deoxycytidilate deaminase 
zinc-binding region which is an active region in AID protein (both mouse AID and human AID SEQ ID NO: 56 to 94) 
were completely consistent (conserved) between mouse and human. 

[0354] As a partial amino acid sequence (amino acid NO: 116 to 132 in SEQ ID NO: 2) of mouse AID protein used 
55 for the preparation of anti-AID protein antibody (Example 5) was completely consistent with a corresponding amino 
acid sequence (amino acid NO: 116 to 132 in SEQ ID NO: 6) of human AID protein, the anti-AID protein antibody was 
expected to comprise cross-reactivity not only with mouse AID protein but also with human AID protein. 
[0355] IHuman AID cDNA obtained in the above was reconstmcted according to the standard method in the manner 
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of genetic engineering so that His-A!D fusion protein added with HIs-tag (a peptide of histidine repeated 10 times) at 
N-termlnal in the human AID protein was produced, and an expression vector was prepared by inserting the cDNA 
Into a plasmid pEF-BOS (Unexamined published Japanese patent No. Hei 2-242687) . The vector was introduced into 
monkey kidney derived cell line C0S7 by tipofection using LIPOFECTAMINE (GIBCO BRL) according to the standard 
5 method. The obtained transgenic cells were cultured by the standard method and His-human AID fusion protein was 
transiently expressed, His-human AID fusion protein was extracted and purified in the same method as the previous 
report, and the production of His-human AID fusion protein was analyzed by Western blotting with the anti-AID antibody 
prepared in Example 5 and a commercial anti-His tag antibody according to the standard method. As a result, the 
His-AID protein was detected as a band comprising about 31 kDa molecular weight in all cases using any antibody. 

10 

Example 17: Detemnination of exons in genomic DNA coding human AID protein 

[0356] Based on the infomnation for the nucleotide sequenceocDN A coding the full length human AID protein above, 
exons in the nucleotide sequences for genomic DNA coding human AID protein in the above were detemiined. 
IS [0357] As a result. It was confirmed to consist of 5 exons. 

Exon 1 (Nucleotide sequence: SEQ ID NO: 11); 
Exon 2 (Nucleotide sequence: SEQ ID NO: 12) ; 
Exon 3 (Nucleotide sequence: SEQ ID NO: 13); 
20 Exon 4 (Nucleotide sequence: SEQ ID NO: 14); and 

Exon 5 (Nucleotide sequence: SEQ ID NO: 15). 

[0358] The exon 1 contains a translation initiation codon ATG which codes the first methionine (Amino acid No: 1 of 
SEQ ID NO: 8) in human AID protein, and the initiation codon corresponds to nucleotide NOs: 80 to 82 in SEQ NO: 11 . 
25 [0359] Specifically, the genomic DNA including DNA coding human AID obtained in the above Examples (SEQ ID 
NO: 9, SEQ ID NO: 10 and SEQ ID NO: 35) consists of introns and exons described below and comprises a full length 
of about 11 kb. Rgure 23 schematically shows the structure. 

<SEQ ID N0:9> 

30 

[0360] 

Intron: Nucleotide NOs: from 1 to 1031 
Exon 1 : Nucleotide NOs: from 1 032 to 11 1 8 
55 Intron: Nucleotide NOs: from 1119 to 5514 

<SEQ1DNO:10> 

[0381] 

40 

Intron: Nucleotide NOs: from 1 to 1064 
Exon 2: Nucleotide NOs: from 1065 to 1212 
intron: Nucleotide NOs: from 1213 to 2591 
Exon 3: Nucleotide NOs: from 2592 to 2862 
45 Intron: Nucleotide NOs: from 2863 to 3155 
Exon 4: Nucleotide NOs: from 31 56 to 3271 
Intron: Nucleotide NOs: from 3272 to 3740 
Exon 5: Nucleotide NOs: from 3741 to 5912 
Intron: Nucleotide NOs: from 5913 to 6564 

50 

<SEQ ID NO: 35> 
[0362] 

55 Intron: Nucleotide NOs: from 1 to 441 

Exon 1 : Nucleotide NOs: from 442 to 528 
Intron: Nucleotide NOs: from 529 to 6279 
Exon 2: Nucleotide NOs: from 6260 to 6427 
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Intron: Nucleotide NOs: from 6428 to 7806 
Exon 3: Nucleotide NOs: from 7807 to 6077 
Intron: Nucleotide NOs: from 8078 to 8370 
Exon 4: Nucleotide NOs: from 8371 to 8486 
5 Intron: Nucleotide NOs: from 8487 to 8955 

Exon 5: Nucleotide NOs: from 8956 to 11 067 
Intron: Nucleotide NOs: from 11068 to 11204 

Example 18: Amplification of a given partial nucleotide sequence of genomic DNA coding human AID protein by PGR 
10 and diagnosis for the presence or absence of mutation In the partial nucleotide sequence 

[0363] The AID protein of the present invention may involve in sidemtion of various Immunodeficiency and allergic 
disease. For example, a given immunodeficiency or allergic disease may be caused by mutation or deletion in the 
nucleotide sequence of genomic DNA (especially exon) coding an AID protein. 
IS [0364] The presence or absence of such mutation or deletion in genomic DNA can be analyzed by, for example, 
following examples. 

(1) A pair of primers comprising a nucleotide sequence complementary to a given partial nucleotide sequence of 
genomic DNA coding AID protein in the present invention is prepared. 
20 (2) Using genomic DNA coding AID protein obtained from tissues or cells of a patient suffering from immunodefi- 

ciency or allergic disease as a template, an objective partial nucleotide sequence of the genomic DNA Is amplified 
with the pair of primer DNA. . 

(3) By analyzing the presence or absence of a PGR product and a nucleotide sequence of the PGR product, and 
comparing the nucleotide sequence with a corresponding nucleotide sequence in genomic DNA coding AID protein 
25 derived from a nomial person, mutation or deletion In the genomic DNA is identified. 

[0365] Specifically, this method enables, for example, not only elucidate relationship between immunodeficiency or 
allergic disease and AID protein, but also diagnose the diseases by the above method in the case that AID protein is 
a cause of sideration of a given type of disease (for example immunodeficiency or allergic disease). 
30 [0366] For the above purpose, the following 1 5 kinds of primers were designed and prepared based on a given partial 
nucleotide sequence in the genomic DNA coding human AID protein. 

Primer: p3 (SEQ ID No. 20) 

Primer:p9(SEQIDNo.21) 
35 Primer: pi 0 (SEQ ID No. 22) 

Primer: p12 (SEQ ID No. 23) 

Primer: p14 (SEQ ID No. 24) 

Primer: p16 (SEQ ID No. 25) 

Primer: pi 7 (SEQ ID No. 26) 
40 Primer pi 9 (SEQ ID No. 27) 

Primer: p26 (SEQ ID No. 28) 

Primer: p29 (SEQ ID No. 29) 

Primer: p36 (SEQ ID No. 30) 

Primer: p4B (SEQ ID No. 31) 
45 Primer: p59 (SEQ ID No. 32) 

Primer: p85 (SEQ ID No. 33) 

Primer p86 (SEQ ID No. 34) 

[0367] By PGR using the above primers as a pair of primers by the following combinations, and a genomic DNA 
50 isolated from human B lymphoma cell RAMOS as a template, a partial nucleotide sequence coding each target human 
AID protein was amplified. Figure 21 shows relative locations of genomic DNA partial nucleotide sequences amplified 
by each primer pair. 

(1) DNA comprising nucleotide sequence of SEQ ID NO: 31 and DNA comprising nucleotide sequence of SEQ ID 
55 NO: 32; 

(2) DNA comprising nucleotide sequence of SEQ ID NO: 20 and DNA comprising nucleotide sequence of SEQ ID 
NO: 22; 

(3) DNA comprising nucleotide sequence of SEQ ID NO: 21 and DNA comprising nucleotide sequence of SEQ ID 



34 



EP1 174509A1 



NO: 30; 

(4) DNA comprising nucleotide sequence of SEQ ID NO: 24 and DNA comprising nucleotide sequence of SEQ ID 
NO: 25; 

(5) DNA comprising nucleotide sequence of SEQ ID NO: 23 and DNA comprising nucleotide sequence of SEQ ID 
5 NO: 27; 

(6) DNA comprising nucleotide sequence of SEQ ID NO: 23 and DNA comprising nucleotide sequence of SEQ ID 

NO: 28; 

(7) DNA comprising nucleotide sequence of SEQ ID NO: 23 and DNA comprising nucleotide sequence of SEQ ID 
NO: 29; 

10 (8) DNA comprising nucleotide sequence of SEQ ID NO: 26 and DNA comprising nucleotide sequence of SEQ ID 

NO: 27; 

(9) DNA comprising nucleotide sequence of SEQ ID NO: 26 and DNA comprising nucleotide sequence of SEQ ID 
NO: 28; 

(10) DNA comprising nucleotide sequence of SEQ ID NO: 26 and DNA comprising nucleotide sequence of SEQ 
IS ID NO: 29; 

(11 ) DNA comprising nucleotide sequence of SEQ ID NO: 34 and DNA comprising nucleotide sequence of SEQ 
ID NO: 28; 

(12) DNA comprising nucleotide sequence of SEQ ID NO: 34 and DNA comprising nucleotide sequence of SEQ 
ID NO: 29; 

20 (13) DNA comprising nucleotide sequence of SEQ ID NO: 33 and DNA comprising nucleotide sequence of SEQ 

ID NO: 29; or, 

(14) DNA comprising nucleotide sequence of SEQ ID NO: 18 and DNA comprising nucleotide sequence of SEQ 
ID NO: 19; 

25 [0368] The condition for PGR was set by the following manner. 

<Reaction solution> 

[0369] A total amount of 20.2 \l\ consisting of DDW (8 p.1), 1 0 X buffer (2 dNTP (2.5 mlVt each, 2 p,l), 2 primer 
30 1 (2 ^1), 2 pJM primer 2 (2 genomic DNA isolated from human B Lymphoma cells (1 85 ng/fil) and Taq polymerase 
(5U/mI, 0.2 ^1), Ex Taq (TAKARA), or Ampli Taq (Pericin Elmer) 

<Reaction> 

35 [0370] Reaction was conducted by the following (A) or (B). 

(A) 1 cycle (reaction at 94'*C for 30 sec) and 40 cycles (reaction at 94 '*C for 10 sec, reaction at 54 **C for 30 sec, 
and reaction at 72 *C for 3 min and 30 sec) and stored at 4 *C 

(B) 1 cycle (reaction at 94 X for 30 sec) and 40 cycles (reaction at 94 **C for 1 0 sec, reaction at 55 ^'C for 30 sec, 
40 and reaction at 72 for 2 min and 1 0 sec) and stored at 4 *C. 

<PCR equipment> 

[0371] A commercial PGR device (Perkin Elmer Themnal Gycler 9700 type) was used. 

45 

Example 19: The expression of human AID mRNA in various human organ tissues 

[0372] The expression of human AID mRNA in various human organ tissues was analyzed by RT-PCR according 
to the standard method (Immunity, Vol. 9, p. 1-10,1 998). 
50 [0373] RT-PGR was conducted by using various tissues set in the human tissue cDNA panel (GLONTECIH) as a 
template according to the standard method. 

[0374] AID cDNA was amplified by primers pi 7 (SEQ ID NO: 26) and p26 (SEQ ID NO: 28) prepared in the above 
and Taq polymerase. 

[0375] As a control, RT-PGR in the same manner was conducted using cDNA of G3PDH as a template and GF 
55 primerandGRprimer(lmmunity, Vol. 9, p.1-10, 1998). 

[0376] Figure 24 shows the result. As a result, Specific expression of mRNA was confimied in lymph node and tonsil. 
This result was consistent with the experimental result in which the expression of mRNA for mouse AID was observed 
in the various lymphatic tissues (Examples B and 9). 
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[03771 On the other hand, when RT-PCR was conducted with the saturated cycle number in the same manner as 
the above in the above FTT-PCR, the expression of AID mRNA was observed in almost all analyzed organs . 

Example 20: Localization of human AID gene on human chromosomes 

5 

[037B] Localization of human AID gene on human chromosomes was analyzed by Fluorescence in situ hybridization 
(FISH) method (Experimental Medicine, Suppl. "Genetic Engineering Hand Book" published by Yodosha, 1992, p. 
271-277). 

[0379] Genomic DNA including human AID gene (exon 1 to exon 5), Isolated in the above, which was labeled with 
10 biotin-11-dUTP (Sigma) by the nick translation method was used as a probe for hybridization. 

[0380] The probe was hybridized with chromosomes in metaphase human cells. Hybridization signals were detected 
using fluorescein isothiocyanate-avidin (DCS) (Vector Laboratories). 

[03B1] Figure 25 shows the result. As a result, human AID gene was found to be localized on chromosome 12p13, 
Moreover, this location was revealed to be near 12p13.1 which is the location for the above APOBEC-1 comprising a 
IS relatively high amino acid sequence homology with the AID protein and has the same cytidine deaminase activity as 
that of the AID protein. 

[0382] It has been reported that some abnonnality on human chromosome locus 1 2p1 3.3-1 2p1 1 .2, 1 2p1 3,2-1 2p24. 1 
and 1 2p13 may be involved in diseases such as acrocallosal syndrome, inflammatory bowel syndrome familial periodic 
fever, respectively, however, causative gene thereof has not been traced yet. It has been suggested that human AID 
20 gene of the present invention may be involved in sideration of such diseases . 

Industrial Applicability 

[0383] The AID protein of the present invention can be considered to have a function of regulating various biological 

25 mechanisms required for generation of antigen-specific Immunoglobulins (specific antibodies), which eliminate non- 
self antigen (foreign antigen, self-reacting cells, etc.) that triggers various diseases. More specifically, the AID protein 
of the present invention can be considered to be one of the enzyme that plays an important role in the genetic editing 
such as RNA editing and so on occurring in germinal center B cells, such as activation of B cells, class switch recom- 
bination of immunoglobulin gene, somatichypermutation, and affinity maturation, which are the mechanisms for gen- 

30 eration of immunoglobulin having high specificity to antigens. 

[0384] The dysfunction of the AID protein of the present invention can be the cause for the humoral immunodeficiency 
since it induces failure of germinal center B cell function, such as antigen-specific B cell activation, class switch re- 
combination, and somaric mutation. Reversely, the breakdown of the regulation of AID protein may induce allergy 
disease or autoimmune disease since it can cause inappropriate B cell activation and needless class switch recombi- 

35 nation and somatic mutation. 

[0385] Therefore, regulation of the function of AID protein and the gene encoding it enables preventing and treating 
various immunodeficiencies, autoimmune diseases , and allergies , whk:h result from, for example, B cell dysfunctions 
(e.g. IgA deficiency, IgA nephropathy, y globulinemia, hyper IgM syndrome, etc.) or class switch deficiency of immu- 
noglobulin. Thus, the AID protein and the gene encoding the AID protein can be targets for the development of drugs 

40 for therapy of diseases mentioned above. 

[0386] Examples of diseases whose onset prevention, symptom remission, therapy and/or symptomatic treatment 
effect is expected by regulating the function of the AID protein of the present invention or the gene encoding it include, 
for example, primary immunodeficiency syndrome with congenital disorder of immune system, mainly immunodeficien- 
cy considered to develop by B cell deficiency, decrease, or. dysfunction (e.g. sex-linked agammaglobulinemia, sex- 

45 linked agammaglobulinemia with growth homnone deficiency, immunoglobulin def teiency with high IgM level, selective 
IgM deficiency, selective IgE deficiency, immunoglobulin heavy chain gene deletion, k chain deficiency, IgA deficiency, 
IgG subclass selective deficiency, CVID (common variable immunodeficiency), infantile transient dysgammaglobuline- 
mia, Rosen syndrome, severe combined immunodeficiency (sex-linked, autosomal recessive), ADA (adenosine deam- 
inase) deficiency, PNP (purine nucleoside phosphorylase) deficiency, MHC class II deticiency. reticular dysplasia, 

50 Wiskott-Aldrich syndrome, ataxia telangiectasia, DIGeorge syndrome, chromosomal aben-ation, familial Ig hypemrie- 
tabolism, hyper IgE syndrome, Gitlin syndrome, Nezelof syndrome, Good syndrome, osteodystrophy, transcobalamin 
syndrome, secretary bead syndrome, etc.), various diseases with antibody production deficiency that are secondary 
immunodeficiency syndrome with disorder of immune system caused by an acquired etiology (for example, AIDS, etc.), 
and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, conjunctivitis, allergic rhinitis, allergic en- 

55 tiBritis, drug-induced allergy, food allergy, allergic urtlcaria, glomerulonephritis, etc.). These could be targets for drug 
development 

[0387] Namely, the AID protein of the present invention, a fragment thereof, a DNA encoding the AID protein, a 
fragment thereof, and an antibody against the AID protein are useful as reagents for developing drugs for prevention 
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and therapy of such diseases. 

[0388] Also, the DNA itself is useful as an antisense drug regulating the function of AID gene at a gene level and in 
a use In gene therapy. The protein or the fragments thereof (e.g. enzyme active site) itself is useful as a dmg. 
[03B9] Furthemiore, an antibody reactive to the AID protein of the present invention or a fragment thereof is extremely 
useful as an antibody drug by regulating functions of the AID protein. 

[0390] Furthermore, the gene (DNA) , protein, and antibody of the present invention are useful as reagents for search- 
ing substrates (e.g. RNA, etc.) interacting (binding) with the protein (enzyme) of the present Invention, or other auxiliaiy 
proteins associated with the protein of the present invention, and for developing drugs targeting the substrates and 
auxiliary proteins. 

[0391 ] Furthemiore, a method for identifying a substance that regulates production of the AID protein of tiie present 
invention or transcription of a gene encoding the AID protein into mRNA are extremely useful as means to develop 
drugs for therapy and prevention of various diseases (especially, immunodeficiency and allergic disease) in which the 
above-mentioned AID protein or AID gene is considered to be Invohred. 
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SEQUENCE LISTING 



<110> Japan Tobacco, Inc. 
Honjo. Tasuku 

<120> Novel Cytidine Deaminase 

<130> J1-101DP2PCT 

<140> 
<141> 

<150> JPll-087192 
<151> 1999-03-29 

<150> JPIl-178999 
<151> 1999-06-24 

<150> JPll-371382 
<151> 1999-12-27 

<160> 35 



so <170> Patentin Ver. 2, 1 



<210> 1 



55 
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<211> 2440 

<212> DM 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (93).. (689) 

<220> 

<221> 5'UTR 
<222> (1)..(92) 

<220> 

<221> 3'UTR 

<222> (690).. (2440) 

<400> 1 

ggcacgagca gcactgaagc agccttgctt gaagcaagct tcctttggcc taagactttg 60 

agggagtcaa gaaagtcacg ctggagaccg at atg gac age ctt ctg atg aag 113 

Met Asp Ser Leu Leu Met Lys 

. 1 • 5 

caa aag aag ttt ctt tac cat ttc asia aat gtc cgc tgg gee aag gga 161 
Gin Lys Lfs Phe Leu Tyr His Phe Lys Ash Val Arg Trp Ala Lys Gly 
10 15 20 
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egg cat gag acc tac etc tgc tac gtg gtg aag agg aga gat agt gee 209 
s Arg His Glu Thr Tyr Leu Cys Tyr Val Val Lys Arg Arg Asp Ser Ala 

25 30 35 

10 

acc tec tgc tea ctg gac ttc ggc cac ctt cgc aac aag tot ggc tgc 257 
Thr Ser Cys Ser Leu Asp Phe Gly His Leu Arg Asn Lys Ser Gly Cys 

IS 

40 45 50 55 

^ cac gtg gaa ttg ttg ttc eta cgc tac ate tea gac tgg gac ctg. gac 305 

His Val Glu Leu Leu Phe Leu Arg Tyr He Ser Asp Trp Asp Leu Asp 
25 60 65 70 

ccg ggc egg tgt tac cgc gtc acc tgg ttc acc tee tgg age ccg tgc 353 

30 

Pro Gly Arg Cys Tyr Arg Val Thr Trp Phe Thr Ser Trp Ser Pro Cys 
75 80 85 

35 

tat gac tgt gee egg cac gtg get gag ttt ctg aga tgg aac ect aac 401 
^ Tyr Asp Cys Ala Arg His Val Ala Glu Phe Leu Arg Trp Asn Pro Asn 

90 95 100 

45 

etc age ctg agg att ttc ace geg cgc etc tac ttc tgt gaa gac cgc 449 
Leu Ser Leu Arg He Phe Thr Ala Arg Leu Tyr Phe Cys Glu Asp Arg 

so 

105 110 115 

^ aag get gag ect gag ggg ctg egg aga ctg cac cgc get ggg gtc cag 497 
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Lys Ala Glu Pro Glu Gly Leu Arg Arg Leu His Arg Ala Gly Val Gin 
120 125 130 135 

ate ggg ate atg acc ttc aaa gac tat ttt tac tgc tgg aat aca ttt 545 
He Gly He Met Thr Pbe Lys Asp Tyr Phe Tyr Cys Tip Asn Thr Phe 
140 145 150 

gta gaa aat cgt gaa aga act ttc aaa gcc tgg gaa ggg eta eat gaa 593 
Val Glu Asn Arg Glu Arg Thr Phe Lys Ala Trp Glu Gly Leu His Glu 
155 160 165 

aat tct gtc egg eta aec aga caa ctt egg cgc ate ctt ttg cec ttg 641 
Asn Scr Val Arg Leu Thr Arg Gin Lou Arg Arg lie Leu Leu Pro Leu 
170 175 180 

tac gaa gtc gat gac ttg cga gat gca ttt cgt atg ttg gga ttt tga 689 
Tyr Glu Val Asp Asp Leu Arg Asp Ala Phe Arg Met Leu Gly Phe 
185 190 195 

aagcaacctc ctggaatgtc acacgtgatg aaatttctct gaagagactg gatagaaaaa 749 

caacccttca actacatgtt tttcttetta agtacteact tttataagtg tagggggaaa 809 

ttatatgact ttttaaaaaa tacttgagct gcacaggacc geeagagcaa tgatgtaact 869 

gagcttgctg tgcaacatcg ccatctactg gggaacagca taacttccag actttgggtc 929 
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gtgaatgatg ctcttttttt tcaacagcat ggaaaagcat atggagacga ccacacagtt 989 
tgttacaccc accctgtgtt ccttgattca tttgaattct caggggtatc agtgacggat 1049 
tcttctattc tttccctcta aggctcactt tcaggggtcc ttttctgaca aggtcacggg 1109 
gctgtcctac agtctctgtc tgagcaatca caagccattc tctcaaaaac attaatactc 1169 
aggcacatgc tgtatgtttt cactgtccgt cgtgtttttc acatttgtat gtgaaagggc 1229 
ttggggtggg atttgaagaa tgcacgatcg cctctgggtg atttcaataa aggatcttaa 1289 
aatgcagatg aggactacga agaaatcact ctgaaaatga gttcacgcct caagaagcaa 1349 
atcccctgga aacacagact ctttttcatt tttaatgtca ttagtttact cacagtctta 1409 
tcaagaagaa gagttcaagg gttcaaccca attttcagat cgcgtccctt aaacatcagt 1469 
aattctgtta aagggatcaa acatccttat ttcttaacta actggtgcct tgctgtagag 1529 
aaaggagcaa agcgcccaga tccaaagtat atagttatca tagccaggaa ccgctactcg 1589 
ttttccatta caaatggcaa attcttcccc gggctctcct catagtgcct gagacggacc 1649 
acggaggtga tgaacctccg gattctctgg cccaacacgg tggaagctct gcaagggcgc 1709 
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agagacagaa tgcggcagaa attgcccccg agtcccaact ctcctttcct tgcgaccttg 1769 



ggaacaagac ttaaaggagc ctgtgactta gaaacttcta gtaatgggta cctgggagtc 1829 



gtttgagtat ggggcagtga tttattctct gtgatggatg ccaacacggt taaacagaat 1889 



ttttagtttt tatatgtgtg tgatgctgct cccccaaatt gttaactgtg taagagggtg 1949 



gcaaaatagg gaaagtggca ttcacctata gttccagcat tcaggaagct gaggcaggag 2009 



gattgtaaat ttgaggccag tctgagctgt aaggtgagac cctatttcaa acaacacagc 2069 



cagaattggg ttctggtaaa tcatacttaa caagggaaaa atgcaagacg caagaccgtg 2129 



gcaaggaaat gacgctttgc ccaacgaaat gtaggaaacc aacatagact cccagtttgt 2189 



ccctctttat gtctggtctc cctaacaacg atctttgcta atgagaaaaa tattagaaaa 2249 



aaatatccct gtgcaattat cacccagtcg ccattataat gcaattaaaa ggcccacaag 2309 



aaatcctgta tacacgaccg ttatttattg tatgtaagtt gctgaggaag aggagaaaaa 2369 



aataaagatc atccattcct tcctgcaaaa aaaaaaaaaa aaanaaaaaa aaaaaaaaaa 2429 



aaaaaaaaaa a 2440 
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<210> 2 
<211> 198 
<212> PRT 
<213> Mus musculus 

<400> 2 

Met Asp Ser Leu Leu Met Lys Gin Lys Lys Phe Leu Tyr His I%e Lys 

15 10 15 

Asn Val Arg Trp Ala Lys Cly Arg His Glu Thr Tyr Leu Cys Tyr Val 

20 25 30 

Val Lys Arg Arg Asp Ser Ala Thr Ser Cys Ser Leu Asp Phe Gly His 

35 40 45 

Leu Arg Asn Lys Ser Gly Cys His Val Glu Leu Leu Phe Leu Arg Tyr 

50 55 60 

lie Ser Asp_ Trp Asp Leu Asp Pro Gly Arg Cys Tyr Arg Val Thr Trp 
65 70 75 80 

Phe Thr Ser Trp Ser Pro Cys Tyr Asp Cys Ala Arg His Val Ala Glu 

S5 90 95 

Phe Leu Arg Trp Asn Pro Asn Leu Ser Leu Arg lie Phe Thr Ala Arg 

100 105 110 

Leu Tyr Phe Cys Glu Asp Arg Lys Ala Glu Pro Glu Gly Leu Arg Arg 

115 120 125 

Leu His Arg Ala Gly Val Gin He Gly He Vet Thr Phe Lys Asp Tyr 
130 135 140 
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?he Tyr Cys Trp Asn Thr Phe Val Glu Asn Arg Glu Arg Thr Phe Lys 
145 150 155 160 

Ala Trp Glu Gly Leu His Glu Asn Ser Val Arg Leu Thr Arg Gin Leu 

165 170 176 

Arg Arg lie Leu Leu Pro Leu Tyr Glu Val Asp Asp Leu Arg Asp Ala 

180 185 190 

Phe Arg Het Leu Gly Phe 
195 



<210> 3 
<211> 30 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificially 
synthesized primer sequence, AID138 

<400> 3 

ggaattcgcc atggacagcc ttctgatgaa 



<210> 4 
<211> 30 
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<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificially 
synthesized primer sequence, AID161 

<400> 4 

gccgctcgag tcaaaatccc aacatacgaa 

<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence'Artificiallf 
synthesized priner sequence, AID118 

<400> 5 

ggctgaggtt agggttccat ctcag 

<210> 6 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificially 
synthesized priner sequence, AID119 

<400> 6 

gagggagtca agaaagtcac gctgg 

<210> 7 
<21I> 2818 
<212> ONA 
<213> Honio sapiens 

<220> 

<221> 5'UIR 
<222> (1)..(79) 

<220> 

<221> CDS 

<222> (80).. (676) 

<220> 

<22i> 3* lira 
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<222> (677).. (2818) 
<400> 7 

agagaaccat cattaattga agtgagattt ttctggcctg agacttgcag ggaggcaaga 60 

agacactctg gacaccact atg gac age etc ttg atg aac egg agg aag ttt 112 

Met Asp Ser Leu Leu Met Asn Arg Arg Lys Phe 
15 10 

ctt tac caa tte aaa aat gtc cgc tgg get aag ggt egg cgt gag ace 160 
Leu Tyr Gin Phe Lys Asn Yal Arg Trp Ala Lys Gly Arg Arg Glu Ihr 
15 20 25 

tac ctg tgc tac gta gtg aag agg cgt gac agt get aca tec ttt tea 208 
Tyr Leu Cys Tyr Val Val Lys Arg Arg Asp Ser Ala Thr Ser Phe Ser 
30 35 40 

ctg gac ttt ggt tat ctt cgc aat aag aac ggc tgc cac gtg gaa ttg 256 
Leu Asp Phe Gly Tyr Leu Arg Asn Lys Asn Gly Cys His Val Glu Leu 
45 50 55 

etc tte etc cgc tac ate teg gac tgg gac eta gac cet ggc cgc tgc 304 
Leu Phe Leu Arg Tyr He Ser Asp Trp Asp Leu Asp Pro Gly Arg Cys 
60 65 70 75 

tac cgc gtc acc tgg tte acc tec tgg age cce tgc tac gac tgt gee 352 
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Tyr Arg Val Thr Tip Phe Thr Ser Trp Ser Pro Cys Tyr Asp Cys Ala 
80 85 90 



10 



IS 



20 



cga cat gtg gcc gac ttt ctg cga ggg aac ccc aac etc agt ctg agg 400 
Arg His Val Ala Asp Phe Leu Arg Gly Asn Pro Asn Leu Ser Leu Arg 
95' 100 105 

ate tte ace gcg- egc etc tac ttc tgt gag gac cgc aag get gag ccc 448 
He Phe Thr Ala Arg Leu Tyr Phe Cys Glu Asp Arg Lys Ala GIu Pro 
110 115 120 



25 



30 



gag ggg ctg egg egg ctg cac cgc gcc ggg gtg caa ata gcc ate atg 496 
Glu Gly Leu Arg Arg Leu His Arg Ala Gly Val Gin He Ala He Met 
125 130 135 



35 



40 



45 



acc ttc* aaa gat tat ttt tac tgc tgg aat act ttt gta gaa aac cat 544 
Thr Phe Lys Asp Tyr Phe Tyr Cys Trp Asn Thr Phe Val Glu Asn His 
140 145 150 155 

gaa aga act tte aaa gcc tgg gaa ggg ctg cat gaa aat tea gtt cgt 592 
Glu Arg Thr Phe Lys Ala Trp Glu Gly Leu His Glu Asn Ser Val Arg 
160 165 170 



50 



55 



etc tec aga cag ctt egg cgc ate ctt ttg ccc ctg tat gag gtt gat 640 
Leu Ser Arg Gin Leu Arg Arg He Leu Leu Pro Leu Tyr Glu Val Asp 
175 180 185 
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gac tta cga gac gca ttt cgt act ttg gga ctt tga tagcaacttc 
Asp Leu Arg Asp Ala Phe Arg Thr Leu Gly Leu . 
190 195 



caggaatgtc acacacgatg aaatatctct gctgaagaca gtggataaaa aacagtcctt 746 



caagtcttct ctgtttttat tcttcaactc tcactttctt agagtttaca gaaaaaatat 806 



ttatatacga ctctttaaaa agatctatgt cttgaaaata gagaaggaac acaggtctgg 866 



ccagggacgt gctgcaattg gtgcagtttt gaatgcaaca ttgtccccta etgggaataa 926 



cagaactgca ggacctggga gcatcctaaa gtgtcaacgt ttttctatga cttttaggta 986 



ggatgagagc agaaggtaga tcctaaaaag catggtgaga ggatcaaatg tttttatatc 1046 



aacatccttt attatttgat tcatttgagt taacagtggt gttagtgata gatttttcta 1106 



ttcttttccc ttgacgttta ctttcaagta acacaaactc ttccatcagg ccatgatcta 1166 



taggacctcc taatgagagt atctgggtga ttgtgacccc aaaccatctc tccaaagcat 1226 



taatatccaa tcatgcgctg tatgttttaa tcagcagaag catgttttta tgtttgtaca 1286 



aaagaagatt gttatgggtg gggatggagg tatagaccat gcatggtcac cttcaagcta 1346 
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ctttaataaa ggatcttaaa atgggcagga ggactgtgaa caagacaccc taataatggg 1406 
ttgatgtctg aagtagcaaa tcttctggaa acgcaaactc ttttaaggaa gtccctaatt 1466 
tagaaacacc cacaaacttc acatatcata attagcaaac aattggaagg aagttgcttg 1526 
aatgttgggg agaggaaaat ctattggctc tcgtgggtct cttcatctca gaaatgccaa 1586 
tcaggtcaag gtttgctaca ttttgtatgt gtgtgatgct tctcccaaag gtatattaac 1646 
tatataagag agttgtgaca aaacagaatg ataaagctgc gaaccgtggc acacgctcat 1706 
agttctagct gcttgggagg ttgaggaggg aggatggctt gaacacaggt gttcaaggcc 1766 
agcctgggca acataacaag atcctgtctc tcaaaaaaaa aaaaaaaaaa aagaaagaga 1826 
gagggccggg cgtggtggct cacgcctgta atcccagcac tttgggaggc cgagccgggc 1886 
ggatcacctg tggtcaggag tttgagacca gcctggccaa catggcaaaa ccccgtctgt 1946 
actcaaaatg caaaaattag ccaggcgtgg tagcaggcac ctgtaatccc agctacttgg 2006 
gaggctgagg caggagaatc gcttgaaccc aggaggtgga ggttgcagta agctgagatc 2066 
gtgccgttgc actccagcct gggcgacaag agcaagactc tgtctcagaa aaaaaaaaaa 2126 
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aaaagagaga gagagagaaa gagaacaata tttgggagag aaggatgggg aagcattgca 2186 
aggaaattgt gctttatcca acaaaatgta aggagccaat aagggatccc tatttgtctc 2246 
ttttggtgtc tatttgtccc taacaactgt ctttgacagt gagaaaaata ttcagaataa 2306 
ccatatccct gtgccgttat tacctagcaa cccttgcaat gaagatgagc agatccacag 2365 
gaaaacttga atgcacaact gtcttatttt aatcttattg tacataagtt tgtaaaagag 2426 
ttaaaaattg ttacttcatg tattcattta tattttatat tattttgcgt ctaatgattt 2486 
tttattaaca tgatttcctt ttctgatata ttgaaatgga gtctcaaagc ttcataaatt 2546 
tataacttta gaaatgattc taataacaac gtatgtaatt gtaacattgc agtaatggtg 2606 
ctacgaagcc atttctcttg atttttagta aacttttatg acagcaaatt tgcttctggc 2666 
tcactttcaa tcagttaaat aaatgataaa taattttgga agctgtgaag ataaaatacc 2726 
aaataaaata atataaaagt gatttatatg aagtt^aat aaaaaatcag tatgatggaa 2786 
taaacttgaa aaaaaaaaaa aaaaaaaaaa aa 2818 
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<210> 8 
<211> 198 
<212> PRT 
<213> H«no sapiens 

<400> 8 

Met Asp Ser Leu Leu Met Asn Arg Arg Lys Phe Leu Tyr Gin Phe Lys 

15 10 15 

Asn Val Arg Trp Ala Lys Gly Arg Arg Glu Thr Tyr Leu Cys Tyr Val 

20 25 30 

Val Lys Arg Arg Asp Ser Ala Thr Ser Phe Ser Leu Asp Phe Gly Tyr 

35 40 45 

Leu Arg Asn Lys Asn Gly Cys His Val Glu Leu Leu Phe Leu Arg Tyr 

50 55 60 

lie Ser Asp Trp Asp Leu Asp Pro Gly Arg Cys Tyr Arg Val Thr Tip 
65 70 75 80 

Phe Thr Ser Trp Ser Pro Cys Tyr Asp Cys Ala Arg His Val Ala Asp 

85 90 95 

Phe Leu Arg Gly Asn Pro Asn Leu Ser Leu Arg He Phe Thr Ala Arg 

100 106 110 

Leu Tyr Phe Cys Glu Asp Arg Lys Ala Glu Pro Glu Gly Leu Arg Arg 

115 120 125 

Leu His Arg Ala Gly Val Gin He Ala He Uet Thr Phe Lys Asp Tyr 

130 135 140 

Phe Tyr Cys Trp Asn Thr Phe Val Glu Asn His Glu Arg Thr Phe Lys 
145 150 155 160 
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Ala Trp Glu Gly Leu His Glu Asn Ser Val Arg Leu Ser Arg Gin Leu 

165 170 175 

Arg Arg He Leu Leu Pro Leu Tyr Glu Val Asp Asp Leu Arg Asp Ala 

180 185 190 

Phe Arg Thr Leu Gly Leu 
195 



<210> 9 
<211> 5514 
<212> DM 
<213> Hofflo sapiens 

<220> 

<221> intron 
<222> (1).. (1031) 

<220> 

<221> axon 

<222> (1032).. (1118) 

<220> 

<221> intron 

<222> (1119).. (5514) 
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<400> 9 

acagaceaat acatggtcca agctagggct 
gtatcaaagg cttgaggcag gaagagagca 
tccctagcac ctggcatagt ttccattaac 
aatagaatgc atatgggcta cagtaggaga 
tatgagagca caaaattaaa gtcttttatt 
tgcagccagt tagacactga ttctgtctgg 
tgctgctgct tctgactcca aattaaggat 
caaaaatcac tctttggtgt aaatatctag 
aagaaaaaaa tccatggttt gggaggcaaa 
gttcatttgc ttaactgcaa agcaggagct 
agactgtggg aatatggggg aattagaggc 
gaagctattt aaatgctctt taaggtattt 
tattttgtgt tatcatgatt ataattgaag 
tagctatgga gcatggactg ggcttttaga 
agcagagctg ccctcaatgg tttaacctgt 
catcttcact ggatccaaat caggagcaag 
gtcaggggag gagcccaaaa gggcaagctc 
cagactgaga cagagaacca tcattaattg 
gggaggcaag aagacactct ggacaccact 
ggtgattgca ctggccttcc tctcagagca 
tttctctcat gtaactgtct gactgataag 
ttgatctgtc tccttttctt ctattcagat 
ttcagacttc tcttgatttc cctctttttc 
actgattcgt cctgagattt gtaccatggt 
tagcaaatct ttagagactc aaatcatgaa 



attgatttga 


aaatcatcaa 


ggtatagatg 


60 


gagaccctag 


ctgcattgct 


tagcattgca 


120 


agtaggcatg 


aagtatctac 


tcagtgaata 


180 


gagaaataaa 


atctttaata 


gaccaagttc 


240 


tgaagatctt 


agcctgtttt 


ccaaattcag 


300 


tgaaacaagc 


atttttgtat 


tttgggggac 


360 


tttttttttt 


tctaaaaaag 


atggctcatg 


420 


tcttcaagca 


attcttgtaa 


tgcaatcaga 


480 


atttttgtgt 


tctaaattct 


atataactga 


540 


gctagtgcct 


gtctgtactg 


aggttcagag 


600 


tatctgaggc 


tcttcaacac 


aataacccaa 


660 


acataaatat 


tactattctc 


attgtgcttt 


720 


tgtctactgt 


tactgcctcc 


tgatctttgc 


780 


gcagcagccc 


caaaggaacc 


taaacattaa 


840 


gtgactctgc 


ctatgacagc 


cccacccacc 


900 


gccgttgggg 


tacctggtgg 


gggtgatgct 


960 


aaatttgaat 


gtgaagggcc 


aatgcactgt 


1020 


aagtgagatt 


tttctggcct 


gagacttgca 


1080 


atggacaggt 


aaagaggcag 


tcttctcgtg 


1140 


aatctgagta 


atgagactgg 


tagctatccc 


1200 


atcagcttga 


tcaatatgca 


tatatatttt 


1260 


cttatacgct 


gtcagcocaa 


ttctttctgt 


1320 


atgtggcaaa 


agaagtagtg 


cgtacaatgt 


1380 


tgaaactaat 


ttatggtaat 


aatattaaca 


1440 


aaggtaatag 


cagtactgta 


ctaaaaacgg 


1500 
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tagtgctaat ttt&gtaata attttgtaaa tattcaacag taaaacaact tgaagacaca 1560 
ctttcctagg gaggcgttac tgaaataatt tagctatagt aagaaaattt gtaattttag 1620 
aaatgccaag cattctaaat taattgcttg aaagtcacta tgattgtgtc cattataagg 1680 
agacaaattc attcaagcaa gttatttaat gttaaaggcc caattgttag gcagttaatg 1740 
gcaettttac tattaactaa tctttccatt tgttcagacg tagcttaact tacctcttag 1800 
gtgtgaattt ggttaaggtc ctcataatgt ctttatgtgc agtttttgat aggttattgt 1860 
catagaactt attctattcc tacatttatg attactatgg atgtatgaga ataacaccta 1920 
atccttatac tttacctcaa tttaactcct ttataaagaa cttacattac agaataaaga 19B0 
ttttttaaaa atatattttt ttgtagagac agggtcttag cccagccgag gctggtctct 2040 
aagtcctggc ccaagcgatc ctcctgcctg ggcctcctaa agtgctggaa ttatagacat 2100 
gagccatcac atccaatata cagaataaag atttttaatg gaggatttaa tgttcttcag 2160 
aaaattttct tgaggtcaga caatgtcaaa tgtctcctca gtttacactg agattttgaa 2220 
aacaagtctg agctataggt ccttgtgaag ggtccattgg aaatacttgt tcaaagtaaa 2280 
atggaaagca aaggtaaaat cagcagttga aattcagaga aagacagaaa aggagaaaag 2340 
atgaaattca acaggacaga agggaaatat attatcatta aggaggacag tatctgtaga 2400 
gctcattagt gatggcaaaa tgacttggtc aggattattt ttaacccgct tgtttctggt 2460 
ttgcacggct ggggatgcag ctagggttct gcctcaggga gcacagctgt ccagagcagc 2520 
tgtcagcctg caagcctgaa acactccctc ggtaaagtcc ttcctactca ggacagaaat 2580 
gacgagaaca gggagctgga aacaggcccc taaccagaga agggaagtaa tggatcaaca 2640 
aagttaacta.gcaggtcagg atcacgcaat tcatttcact ctgactggta acatgtgaca 2700 
gaaacagtgt aggcttattg tattttcatg tagagtagga cccaaaaatc cacccaaagt 2760 
cctttatcta tgccacatcc ttcttatcta tacttccagg acactttttc ttccttatga 2820 
taaggctctc tctctctcca cacacacaca cacacacaca cacacacaca cacacacaca 2880 
cacaaacaca caccccgcca accaaggtgc iatgtaaaaag atgtagattc ctctgccttt 2940 
ctcatctaca cagcccagga gggtaagtta atataagagg gatttattgg taagagatga 3000 
tgcttaatct gtttaacact gggcctcaaa gagagaattt cttttcttct gtacttatta 3060 
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agcacctatt 


atgtgttgag 


cttatatata 


agtaatgktg 


gttggtacta 


tggtaattac 


ctaattatta 


ttggatcttt 


tttagtattc 


aaaagacaat 


ctcaccctgt 


tacccaggct 


cagtcttgaa 


ctcctgggct 


eaageaatcc 


acagtcatga 


gccactgcat 


ctggcctagg 


ttttaaaata 


atatggctaa 


tttttacctt 


tttgctgcct 


aaagtttaaa 


gtgctttcca 


taaagtgaaa 


cagacagcca 


ggtgtggtgg 


gctgaggtgg 


gtggatcgct 


tgagccctgg 


aaccctgttt 


ctataacaaa 


aattagccgg 


actagggggc 


tgaggcagga 


gaatctttgg 


tgcttgcgcc 


actgcactcc 


agcctgggtg 


gaagaaaaat 


taaaaataaa 


tggaaacaac 


ttagttaggc 


tgatattttg 


gtatttaact 


attattaaaa 


tatcaattct 


caatgtatat 


agtaccttta 


ttcacaaaac 


cccaaagtag 


caaataaaca 


aaatgtgcta 


tatccatgca 


aagctacttg 


gggatgaatc 


ccaaagtcat 


aggagataat 


gtatgccata 


cgaaattcta 


gcaaatcagg 


gcaggcatag 


aggctcacac 


Re^aacatts 


ctagaactca 


^cafTttcaae 


tctccacaaa 


aatgggaaaa 


aaagaaagca 


gactgcaaag 


agggaagaag 


ctctggtggg 


gactgtggta 


gcagtttggg 


gtgtttacat 


aatgggtgga 


gtttactgta 


tgtaaattat 



caaagggtta ttatatgcta atatagtaat 3120 
cataaaaatt avtatccttt taaaataaag 3180 
attttatgtt ttttatgttt ttgatttttt 3240 
ggagtgcagt ggtgcaatca tagctttctg 3300 
tcctgccttg gcctcccaaa gtgttgggat 3360 
atccatttag attaaaatat gcattttaaa 3420 
atgtaatgtg tatactggta ataaatctag 3480 
ataagcttca tgtacgtgag gggagacatt 3540 
ctcacgcctg taatcccagc actctgggag 3600 
agttcaagac cagcctgagc aacatggcaa 3660 
gcatggtggc atgtgcctgt ggtcccagct 3720 
agcccaggag gtcaaggctg cactgagcag 3780 
acaggaccag accttgcctc aaaaaaataa 3840 
tacaaagagc tgttgtccta gatgagctac 3900 
tttaaagtca gggtctgtca cctgcactac 3960 
ccacacaasg actggtacgt gaatgttcat 4020 
agactatcca aatatccatc aacaagtgaa 4080 
atggaatacc accctgcagt acaaaggaag 4140 
gacgctaaat gaaagagtca gacatgaagg 4200 
gaaaatgaaa gtaacttata gttacagaaa 4260 
ctgtaatccc agcactttga gaggccacgt 4320 
accagcctgg gcaacacagt gaaactccat 4380 
aatcagtggt tgtcctgtgg ggaggggaag 4440 
gtgagggtgg tgattcaggt tctgtatcct 4500 
ccaaaaatat tcgtagaatt atgcatctta 4560 
acctcaatgt aagaaaaaat aatgtgtaag 4620 



57 



EP1 174 509 A1 



aaaagtttca attctcttgc cagcaaacgt tattcaaatt cctgagccct ttacttcgca 4680 
aattctctgc acttctgccc cgtaccatta ggtgacagca ctagctccac aaattggata 4740 
aatgcatttc tggaaaagac tagggacaaa atccaggcat cacttgtgct ttcatatcaa 4800 
ccacgctgta cagcttgtgt tgctgtctgc agctgcaatg gggactcttg atttctttaa 4860 
ggaaacttgg gttaccagag tatttccaca aatgctattc aaattagtgc ttatgatatg 4920 
caagacactg tgctaggagc cagaaaacaa agaggaggag aaatcagtca ttatgtggga 4980 
acaacatagc aagatattta gatcattttg actagttaaa aaagcagcag agtacaaaat 5040 
cacacatgca atcagtataa tccaaatcat gtaaatatgt gcctgtagaa agactagagg 5100 
aataaacaca agaatcttaa cagtcattgt cattagacac taagtctaat tattattatt 5160 
agacactatg atatttgaga tttaaaaaat ctttaatatt ttaaaattta gagctcttct 5220 
atttttccat agtattcaag tttgacaatg.atcaagtatt actctttctt tttttttttt 5280 
tttttttttt tttgagatgg agttttggtc ttgttgccca tgctggagtg gaatggcatg 5340 
aycatagctc actgcaacct ccacctcctg ggttcaagca aagctgtcgc ctcagcctcc 5400 
cgggtagatg ggattacagg cgcccaccac cacactcggc taatgtttgt atttttagta 5460 
gagatggggt ttcaccatgt tggccaggct ggtctcaaac tcctgacctc agag 5514 



<210> 10 
<211> 6564 
<212> DNA 
<213> Homo sapiens 

<400> 10 

gggggcctgt aatcccagct actcaggagg 
gatctgcctg agcctgggag gttgaggcta 
agcctgggcg acaaagtgag accgtaacaa 



ctgaggcagg aggatccgcg gagcctggca 60 
cagtaagcca agatcatgcc agtatacttc 120 
aaaaaaaaaa atttaaaaaa agaaatttag 180 
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atcaagatcc aactgtaaaa agtggcctaa 
tgcaggcaga agagaaccat cagggggtct 
ttttgtgaga tcatggtggt gacagtgtgg 
acagaccggt taaaaggcca gcacaacaga 
aagcagagaa gagcaaacag ggaaggtaca 
acacatttag atgattaatt aaatatgagg 
ttccaggctg ctaggctget tacctgaggt 
cagggggcag ttgaggaata ttgttttgat 
ttaggtaaag actggagggg aaatctgaat 
ttttattttt tgtttcgttt tcttgttgaa 
agcatctaga agacagtggc aggaggtgac 
atgagtatct ctcaattggc cttaaatata 
ggctcagcag ggctcaggag ggctcaggca 
ggtttagccc aagtaatgac ttccttaaaa 
ataaactgta ctcttgcatt ttctctccct 
aggaagtttc tttaccaatt caaaaatgtc 
ctgtgctacg tagtgaagag gcgtgacagt 
cttcgcaata aggtatcaat taaagtcagc 
tgcttttaga gccacctgct gatggtatta 
atcacattcc tcaaatcctt ttttttattt 
catggcccaa aatatgtgat ttaattcctc 
ccttccttca gtgccaagaa caactgctcc 
gaattgcctt tgagattaat taagctaaaa 
tgtccaagca aaaattttaa atgtgaaaaa 
aaggaagaag aatttgggaa aaaattaacg 
ttttccctcc tactcacatg ggtcgtaggc 



acaccacatt 


aaagagtttg 


gagtttattc 


240 


tcagcatggg 


aatggcatgg 


tgcacctggt 


300 


ggaatgttat 


tttggaggga 


ctggagg^g 


360 


taaggaggaa 


gaagatgagg 


gcttggaccg 


420 


aattcaagaa 


atattggggg 


gtttgaatca 


480 


actgaggaat 


aagaaatgag 


tcaaggatgg 


540 


ggcaaagtcg 


ggaggagtgg 


cagtttagga 


600 


cattttgagt 


ttgaggtaca 


agttggacac 


660 


atacaattat 


gggactgagg 


aacaagttta 


720 


gaacaaattt 


aattgtaatc 


ccaagtcatc 


780 


tgtcttgtgg 


gtaagggttt 


egggtccttg 


840 


agcaggaaaa 


ggagtttatg 


atggattcca 


900 


gccagcagag 


gaagtcagag 


catcttcttt 


960 


agctgaagga 


aaatccagag 


tgaccagatt 


1020 


cctctcaccc 


acagcctctt 


gatgaaccgg 


1080 


cgctgggcta 


agggtcggcg 


tgagacctac 


1140 


gctacatcct 


tttcactgga 


ctttggttat 


1200 


tttgcaagca 


gtttaatggt 


caactgtgag 


1260 


cttccatcct 


tttttggcat 


ttgtgtctct 


1320 


ctttttccat 


gtccatgcac 


ccatattaga 


1380 


cccagtaatg 


ctgggcaccc 


taataccact 


1440 


/*n sin IT+ + + 






luvv 


gcatttttat 


atgggagaat 


attateagct 


1560 


caaattgtgt 


cttaagcatt 


tttgaaaatt 


1620 


gtggttcaat 


tctgttttcc 


aaatgstttc 


1680 


cagtgaatac 


attcaacatg 


gtgatcccca 


1740 
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gaaaactcag agaagcctcg 


gctgatgatt aattaaattg 


atctttcggc 


tacccgagag 


1800 


aattacattt ccaagagact 


tcttcaccaa aatccagatg 


ggtttacata 


aacttctgcc 


1860 


catgggtatc tcctctctcc 


taacacgctg tgacgtctgg 


gcttggtgga 


atctcaggga 


1920 


agcatccgtg gggtggaagg 


tcatcgtctg gctcgttgtt 


tgatggttat 


attaccatgc 


1980 


aattttcttt gcctacattt 


gtattgaata catcccaatc 


tccttcctat 


tcggtgacat 


2040 


gacacattct atttcagaag 


gctttgattt tatcaagcac 


tttcatttac 


ttctcatggc 


2100 


agtgcctatt acttctctta 


caatacccat ctgtctgctt 


taccaaaatc 


tatttcccct 


2160 


tttcagatcc tcccaaatgg 


tcctcataaa ctgtcctgcc 


tccacctagt 


ggtccaggta 


2220 


tatttccaca atgttacatc 


aacaggcact tctagccatt 


ttccttctca 


aaaggtgcaa 


2280 


aaagcaactt cataaacaca 


aattaaatct tcggtgaggt 


agtgtgatgc 


tgcttcctcc 


2340 


caactcagcg cacttcgtct 


tcctcattcc acaaaaaccc 


atagccttcc 


ttcactctgc 


2400 


aggactagtg ctgccaaggg 


ttcagctcta cctactggtg 


tgctcttttg 


agcaagttgc 


2460 


ttagcctctc tgtaacacaa 


ggacaatagc tgcaagcatc 


cccaaagatc 


attgcaggag 


2520 


acaatgacta aggctaccag 


agccgcaata aaagtcagtg 


aattttagcg 


tggtcctctc 


2580 


tgtctctcca gaacggctgc 


cacgtggaat tgctcttcct 


ccgctacatc 


tcggactggg 


2640 


acctagaccc •^tggccgctgc 


taccgcgtca cctggttcac 


ctcctggagc 


ccctgctacg 


2700 


actgtgcccg acatgtggcc 


gactttctgc gagggaaccc 


caacctcagt 


ctgaggatct 


2760 


tcaccgcgcg cctctacttc 


tgtgaggacc gcaaggctga 


gcccgagggg 


ctgcggcggc 


2820 


tgcaccgcgc cggggtgcaa 


atagccatca tgaccttcaa 


aggtgcgaaa 


gggccttccg 


2880 


cgcaggcgca gtgcagcagc 


ccgcattcgg gattgcgatg 


cggaatgaat 


gagttagtgg 


2940 


ggaagctcga ggggaagaag 


tgggcgggga ttctggttca 


cctctggagc 


cgaaattaaa 


3000 


gattagaagc agagaaaaga 


gtgaatggct cagagacaag 


gccccgagga 


aatgagaaaa 


3060 


tggggccagg gttgcttctt 


tcccctcgat ttggaacctg 


aactgtcttc 


tacccccata 


3120 


tccccgcctt tttttccttt 


tttttttttt tgaagattat 


ttttactgct 


ggaatacttt 


3180 


tgtagaaaac cacgaaagaa 


ctttcaaagc ctgggaaggg 


ctgcatgaaa 


attcagttcg 


3240 


tctctccaga cagcttcggc 


gcatcctttt ggtaaggggc 


ttcctcgctt 


tttaaatttt 


3300 
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ctttctttct 


atacagtctt 


ttttggagtt 


tcgtatattt 


cttatatttt 


cttattgttc 3360 


aatcactctc 


agttttcatc 


tgatgaaaac 


tttatttctc 


ctccacatca 


gctttttctt 3420 


ctgctgtttc 


accattcaga 


gccctctgct 


aaggttcctt 


ttccctccct 


tttctttctt 3480 


ttgttgtttc 


acatctttaa 


atttctgtct 


ctccccaggg 


ttgcgtttcc 


ttcctggtca 3540 


gaattctttt 


ctcctttttt 


tttttttttt 


tttttttttt 


taaacaaaca 


aacaaaaaac 3600 


ccaaaaaaac 


tctttcccaa 


tttactttct 


tccaacatgt 


tacaaagcca 


tccactcagt 3660 


ttagaagact 


ctccggcccc 


accgaccccc 


aacctcgttt 


tgaagccatt 


cactcaattt 3720 


gcttctctct 


ttctctacag 


cccctgtatg 


aggttgatga 


cttacgagac 


gcatttcgta 3780 


ctttgggact 


ttgatagcaa 


cttccaggaa 


tgtcacacac 


gatgaaatat 


ctctgctgaa 3840 


gacagtggat 


aaaaaacagt 


ccttcaagtc 


ttctctgttt 


ttattcttca 


actctcactt 3900 


tcttagagtt 


tacagaaaaa 


atatttatat 


acgactcttt 


aaaaagatct 


atgtcttgaa 3960 


aatagagaag 


gaacacaggt 


ctggccaggg 


acgtgctgca 


attggtgcag 


ttttgaatgc 4020 


aacattgtcc 


cctactggga 


ataacagaac 


tgcaggacct 


gggagcatcc 


taaagtgtca 4080 


acgtttttct 


atgactttta 


ggtaggatga 


gagcagaagg 


tagatcctaa 


aaagcatggt 4140 


gagaggatca 


aatgttttta 


tatcaacatc 


ctttattatt 


tgattcattt 


gagttaacag 4200 


tggtgttagt 


gatagatttt 


tctattcttt 


tcccttgacg 


tttactttca 


agtaacacaa 4260 


actcttccat 


caggccatga 


tctataggac 


ctcctaatga 


gagtatctgg 


gtgattgtga 4320 


ccccaaacca 


tctctccaaa 


gcattaatat 


ccaatcatgc 


gctgtatgtt 


ttaatcagca 4380 


gaagcatgtt 


tttatgtttg 


tacaaaagaa 


gattgttatg 


ggtggggatg 


gaggtataga 4440 


ccatgcatgg 


tcaccttcaa 


gctactttaa 


taaaggatct 


taaaatgggc 


aggaggactg 4500 


tgaacaagac 


accctaataa 


tgggttgatg 


tctgaagtag 


caaatcttct 


ggaaacgcaa 4560 


actcttttaa 


ggaagtccct 


aatttagaaa 


cacccacaaa 


cttcacatat 


cataattagc 4620 


aaacaattgg 


aaggaagttg 


cttgaatgtt 


ggggagagga 


aaatctattg 


gctctcgtgg 4680 


gtctcttcat 


ctcagaaatg 


ccaatcaggt 


caaggtttgc 


tacattttgt 


atgtgtgtga 4740 


tgcttctccc 


aaaggtatat 


taactatata 


agagagttgt 


gacaaaacag 


aatgataaag 4800 


ctgcgaaccg 


tggcacacgc 


tcatagttct 


agctgcttgg 


gagSttgagg 


agggaggatg 4860 



61 



EP1 174 509 A1 



gcttgaacac 


aggtgttcaa 


ggccagcctg 


ggcaacataa 


caagatectg 


tctctcaaaa 


4920 


aaaaaaaada 


aaaaaagaaa 


gagagagggc 


cgggcgtggt 


ggctcacgcc 


tgtaatccca 


4980 


gcactttggg 


aggccgagcc 


gggcggatca 


cctgtggtca 


ggagtttgag 


accagcctgg 


5040 


ccaacatggc 


aaaaccccgt 


ctgtactcaa 


aatgcaaaaa 


ttagccaggc 


gtggtagcag 


5100 


gcacctgtaa 


tcccagctac 


ttgggaggct 


gaggcaggag 


aatcgcttga 


acccaggagg 


5160 


tggaggttgc 


agtaagctga 


gatcgtgccg 


ttgcactcca 


gcctgggcga 


caagagcaag 


5220 


actctgtctc 


agaaaaaaaa 


aaaaaaaaga 


gagagagaga 


gaaagagaac 


aatatttggg 


5280 


agagaaggat 


gseeaagcat 


tgcaaggaaa 


ttgtgcttta 


tccaacaaaa 


tgtaaggagc 


5340 


caataaggga 


tccctatttg 


tctcttttgg 


tgtctatttg 


tccctaacaa 


ctgtctttga 


5400 


cagtgagaaa 


aatattcaga 


ataaccatat 


ccctgtgccg 


ttattaccta 


gcaacccttg 


5460 


caatgaagat 


gagcagatcc 


acaggaaaac 


ttgaatgcac 


aactgtctta 


ttttaatctt 


5520 


attgtacata 


agtttgtaaa 


agagttaaaa 


attgttactt 


catgtattca 


tttatatttt 


5580 


atattatttt 


gcgtctaatg 


attttttatt 


aacatgattt 


ccttttctga 


tatattgaaa 


5640 


tggagtctca 


aagcttcata 


aatttataac 


tttagaaatg 


attctaataa 


caacgtatgt 


5700 


aattgtaaca 


ttgcagtaat 


ggtgctacga 


agccatttct 


cttgattttt 


agtaaacttt 


5760 


tatgacagca 


aatttgcttc 


tggctcactt 


tcaatcagtt 


aaataaatga 


taaataattt 


5820 


tggaagctgt 


gaagataaaa 


taccaaataa 


aataatataa 


aagtgattta 


tatgaagtta 


5880 


aaataaaaaa 


tcagtatgat 


ggaataaact 


tgagagtcca 


gaagttatcc 


catacatctg 


5940 


taatcaacta 


atttctcaca 


agggtgtaag 


gaccattcaa 


tggagaaaaa 


atgatcttct 


6000 


caacaaatgg 


tgctgagcta 


attggatatt 


acatgcaaag 


gaatgaattt 


gagtctctac 


6060 


tacacaccat 


atataaaaat 


taattaaaaa 


ttcatcaaat 


acctaaatat 


tagagactaa 


6120 


tttataaacc 


gtagagagaa 


acataggtaa 


aaatgtttat 


ggctttagat 


taggcaacag 


6180 


cttcttaatt 


atgacatcaa 


aagcacaagc 


aaccaaagac 


aaaaataaat 


cagttggact 


6240 


tcatcgaaat 


taaaaatctt 


tgtgcatcaa 


aggacactta 


gtaagaaagt 


gaaaagacaa 


6300 


cccacagaag 


tgggagaaaa 


cacttgcaaa 


tcatatatct 


gataagggtt 


gtgatattat 


6360 


gatatatata 


taggtttttg 


tccatagttc 


ctggcttata 


aaccccctca 


cccttgttac 


6420 
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agtcatttgt tataaggttg gatggtttag gcctcagaag caaaactctc tctctcacct 6480 
tctccagccc tcctgtctct ggcacctcat tcttccctga ggccacatag aaactagaat 6540 
ctctcttcca caaggcggtc aaag 6564 



<210> 11 

<211> 87 

<212> DNA 

<213> Homo sapiens 

<400> 11 

agagaaccat cattaattga agtgagattt ttctggcctg agacttgcag ggaggcaaga 60 
agacactctg gacaccacta tggacag 87 



<210> 12 
<211> 148 
<212> DNA 
<213> Homo sapiens 

<400> 12 

cctcttgatg aaccggagga agtttcttta ccaattcaaa aatgtccgct gggctaaggg 60 
tcggcgtgag acctacctgt gctacgtagt gaagaggcgt gacagtgcta catccttttc 120 
actggacttt ggttatcttc gcaataag 148 
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<210> 13 
<211> 271 
<212> DM 
<213> Homo sapiens 

<400> 13 

aacggctgcc acgtggaatt gctcttcctc cgctacatct cggactggga cctagaccct 60 
ggccgctgct accgcgtcac ctggttcacc tcctggagcc cctgctacga ctgtgcccga 120 
catgtggccg actttctgcg agggaacccc aacctcagtc tgaggatctt caccgcgcgc 180 
ctctacttct gtgaggaccg caaggctgag cccgaggggc tgcggcggct gcaccgcgcc 240 
ggggtgcaaa tagccatcat gaecttcaaa g 271 

<210> 14 
<211> 116 
<212> DNA 
<213> Homo sapiens 

<400> 14 

attattttta ctgctggaat acttttgtag aaaaecacga aagaactttc aaagcctggg 60 
aagggctgca tgaaaattca gttcgtctct ccagacagct tcggcgcatc cttttg 116 

<210> 15 

<211> 2172 
<212> DNA 
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<213> HoDO 


sapiens 

• 








<400> 15 










cccctgtatg 


aggttgatga 


cttacgagac 


gcatttcgta ctttgggact ttgatagcaa 


60 


cttccaggaa 


tgtcacacac 


gatgaaatat 


ctctgctgaa gacagtggat aaaaaacagt 


120 


ccttcaagtc 


ttctctgttt 


ttattcttca 


actctcactt tcttagagtt tacagaaaaa 


180 


atatttatat 


acgactcttt 


aaaaagatct 


atgtcttgaa aatagagaag gaacacaggt 


240 


ctggccaggg 


acgtgctgca 


attggtgcag 


ttttgaatgc aacattgtcc cctactggga 


300 


ataacagaac 


tgcaggacct 


gggagcatcc 


taaagtgtca acgtttttct atgactttta 


360 


ggtaggatga 


gagcagaagg 


tagatcctaa 


aaagcatggt gagaggatca aatgttttta 


420 


tatcaacatc 


ctttattatt 


tgattcattt 


gagttaacag tggtgttagt gatagatttt 


480 


tctattcttt 


tcccttgacg 


tttactttcB 


Bgtaacacaa actcttccat caggccatga 


540 


tctataggac 


ctcctaatga 


gagtatctgg 


gtgattgtga ccccaaacca tctctccaaa 


600 


gcattaatat 


ccaatcatgc 


gctgtatgtt 


ttaatcagca gaagcatgtt tttatgtttg 


660 


tacaaaagaa 


gattgttatg 


ggtggggatg 


gaggtataga ccatgcatgg tcaccttcaa 


720 


gctactttm 


taaaggatct 


taaaaitgggc 


aggaggactg tgaacaagac accctaataa 


780 


tgggttgatg 


tctgaagtag 


caaatcttct 


ggaaacgcaa actcttttaa ggaagtccct 


840 


aatttagaaa 


cacccacaaa 


cttcacatat 


cataattagc aaacaattgg aaggaagttg 


900 


cttgaatgtt 


ggggagagga 


aaatctattg 


gctctcgtgg gtctcttcat ctcagaaatg 


960 


ccaatcaggt 


caaggtttgc 


tacattttgt 


atgtgtgtga tgcttctccc aaaggtatat 


1020 


taactatata 


agagagttgt 


gacaaaacag 


aatgataaag ctgegaaccg tggcacacgc 


1080 


tcatagttct 


agctgcttgg 


gaggttgagg 


agggaggatg gcttgaacac aggtgttcaa 


1140 


ggccagcctg 


ggcaacataa 


caagatcctg 


tctctcaaaa aaaaaaaaaa aaaaaagaaa 


1200 


gagagagggc 


cgggcgtggt 


ggctcacgcc 


tgtaatccca gcactttggg aggccgagcc 


1260 


gggcggatca 


cctgtggtca 


ggagtttgag 


accagcctgg ccaacatggc aaaaccccgt 


1320 


ctgtactcaa 


aatgcaaaaa 


ttagccaggc 


gtggtagcag gcacctgtaa tcccagctac 


1380 
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ttgggaggct gaggcaggag aatcgcttga acccaggagg tggaggttgc agtaagctga 1440 
gatcgtgccg ttgcactcca gcctgggcga caagagcaag actctgtctc agaaaaaaaa 1500 
aaaaaaaaga gagagagaga gaaagagaac aatatttggg agagaaggat ggggaagcat 1560 
tgcaaggaaa ttgtgcttta tccaacaaaa tgtaaggagc caataaggga tccctatttg 1620 
tctcttttgg tgtctatttg tccctaacaa ctgtctttga cagtgagaaa aatattcaga 1680 
ataaccatat ccctgtgccg ttattaccta gcaacccttg caatgaagat gagcagatcc 1740 
acaggaaaac ttgaatgcac aactgtctta ttttaatctt attgtacata agtttgtaaa 1800 
agagttaaaa attgttactt catgtattca tttatatttt atattatttt gcgtctaatg 1860 
attttttatt aacatgattt ccttttctga tatattgaaa tggagtctca aagcttcata 1920 
aatttataac tttagaaatg attctaataa caacgtatgt aattgtaaca ttgcagtaat 1980 
ggtgctacga agccatttct cttgattttt agtaaacttt tatgacagca aatttgcttc 2040 
tggctcactt tcaatcagtt aaataaatga taaataattt tggaagctgt gaagataaaa 2100 
taccaaataa aataatataa aagtgattta tatgaagtta aaataaaaaa tcagtatgat 2160 
ggaataaact tg 2172 

<210> 16 
<211> 25 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence .'Artificially 
synthesized primer sequence, 170 

<400> 16 
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gagaccgata tggacagcct tctga 

<210> 17 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence:Artificially 
synt jesized primer sequence, 181 

<400> 17 

tcacgtgtga cattccagga ggttgct 

<210> 18 
<211> 30 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence, 22 

<400> 18 
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gtagtgaaga ggcgtgacag tgctacatcc 

<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized priner sequence, 25 

<400> 19 

grttccctcgc agaaagtcgg ccacatg 

<210> 20 
<211> 24 
<212> DNA 

<213> Artificial' Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized priner sequence, p3 

<400> 20 
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gagttteagg tacaagttgg acac 

<210> 21 
<211> 23 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Ai-tificially 
synthesized primer sequence, p9 

<400> 21 

tatctcctct ctcctaacac get 

<210> 22 
<211> 23 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence^Artificially 
synthesized primer sequence, plO 

<400> 22 
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acaagctgat aatattctcc cat 

<210> 23 
<211> 22 
<212> DM 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence, pi 2 

<400> 23 

tcttcggtga ggtagtgtga tg 

<210> 24 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence, pl4 

<400> 24 
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agcetcttga tgaaccgsag gaagtttctt 

<210> 25 
<211> 28 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificially 
synthesized primer sequence, pl6 

<400> 25 

ttattgcgaa gataaecaaa gtccagtg 

<210> 26 
<2I1> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Artificially 
synthesized primer sequence, pl7 

<400> 26 
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tagaccctgg ccgctgctac c 

<210> 27 
<211> 22 

<2i2> m 

<213> Artificial Sequence - 
<220> 

<223> Description of Artificial SequencetArtificially 
synthesized priner sequence. pl9 

<400> 27 

cgcatcgcaa tcccgaatgc gg 

<21Q> 28 
<211> 28 
<212> UNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificially 
synthesized primer sequence, p26 

<400> 28 
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caaaaggatg cgccgaagct gtctggag 

<210> 29 
<211> 23 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificiall7 
synthesized primer sequence, p29 

<400> 29 

gttggaagaa agtaaattgg gaa 

<210> 30 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence, p36 

<400> 30 
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gatactctca ttaggaggtc c 

<210> 31 
<211> 26 

<212> DM 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificially 
synthesized primer sequence, p48 

<400> 31 

cattaattga agtgagattt ttctgg 

<210> 32 
<211> 22 

<2i2> m 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Artificially 
synthesized primer sequence, pS9 

<400> 32 
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agcatttgtg gaaatactct gg 

<210> 33 
<2U> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Artificially 
synthesized primer sequence, p85 

<400> 33 

aactttattt ctcctccaca tcag 

<210> 34 
<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence-'Artificially 
synthesized primer sequence, p86 

<400> 34 
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gtgaatggct cagagacaag g 21 



<210> 35 
<211> 11204 
<212> DNA 
<213> Homo sapiens 

<400> 35 

aggttcagag agactgtggg aatatggggg aattagaggc tatctgaggc tcttcaacac 60 
aataacccaa gaagctattt aaatgctctt taaggtattt acataaatat tactattctc 120 
attgtgcttt tattttgtgt tatcatgatt ataattgaag tgtctactgt tactgcctcc 180 
tgatctttgc tagctatgga gcatggactg ggcttttaga gcagcagccc caaaggaacc 240 
taaacattaa agcagagctg ccctcaatgg tttaacctgt gtgactctgc ctatgacagc 300 
cccacccacc catcttcact ggatccaaat caggagcaag gccgttgggg tacctggtgg 360 
gggtgatgct gtcaggggag gagcccaaaa gggcaagctc aaatttgaat gtgaagggcc 420 
aatgcactgt cagactgaga cagagaacca tcattaattg aagtgagatt tttctggcct 480 
gagacttgca gggaggcaag aiagacactct ggacaccact atggacaggt aaagaggcag 540 
tcttctcgtg ggtgattgca ctggccttcc tctcagagca aatctgagta atgagactgg 600 
tagctatccc ttt etc teat gtaactgtct gactgataag atcagcttga tcaatatgca 660 
tatatatttt ttgatctgtc tccttttctt ctattcagat cttatacgct gtcagcccaa 720 
ttctttctgt ttcagacttc tcttgatttc cctctttttc atgtggcaaa agaagtagtg 780 
cgtacaatgt actgattcgt cctgagattt gtaccatggt tgaaactaat ttatggtaat 840 
aatattaaca tagcaaatct ttagagactc aaatcatgaa aaggtaatag cagtactgta 900 
ctaaaaacgg tagtgctaat tttcgtaata attttgtaaa tattcaacag taaaacaact 960 
tgaagacaca ctttcctagg gaggcgttac tgaaataatt tagctatagt aagaaaattt 1020 
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gtaattttag aaatgccaag cattctaaat taattgcttg aaagtcacta tgattgtgtc 1080 
cattataagg agacaaattc attcaagcaa gttatttaat gttaaaggcc caattgttag 1140 
gcagttaatg gcacttttac tattaactaa tctttccatt tgttcagacg tagcttaact 1200 
tacctcttag gtgtgaattt ggttaaggtc ctcataatgt ctttatgtgc agtttttgat 1260 
aggttattgt catagaactt attctattcc tacatttatg attactatgg atgtatgaga 1320 
ataacaccta atccttatac tttacctcaa tttaactcct ttataaagaa cttacattac 1380 
agaataaaga ttttttaaaa atatattttt ttgtagagac agggtcttag cccagccgag 1440 
gctggtctct aagtcctggc ccaagcgatc ctcctgcctg ggcctcctaa agtgctggaa 1500 
ttatagacat gagccatcac atccaatata cagaataaag atttttaatg gaggatttaa 1560 
tgttcttcag aaaattttct tgaggtcaga caatgtcaaa tgtctcctca gtttacactg 1620 
agattttgaa aacaagtctg agctataggt ccttgtgaag ggtccattgg aaatacttgt 1680 
tcaaagtaaa atggaaagca aaggtaaaat cagcagttga aattcagaga aagacagaaa 1740 
aggagaaaag atgaaattca acaggacaga agggaaatat attatcatta aggaggacag 1800 
tatctgtaga gctcattagt gatggcaaaa tgacttggtc aggattattt ttaacccgct 1860 
tgtttctggt ttgcacggct ggggatgcag ctagggttct gcctcaggga gcacagctgt 1920 
ccagagcagc tgtcagcctg caagcctgaa acactccctc ggtaaagtcc ttcctactca 1980 
ggacagaaat gacgagaaca gggagctgga aacaggcccc taaccagaga agggaagtaa 2040 
tggatcaaca aagttaacta gcaggtcagg atcacgcaat tcatttcact ctgactggta 2100 
acatgtgaca gaaacagtgt aggcttattg tattttcatg tagagtagga cccaaaaatc 2160 
cacccaaagt cctttatcta tgccacatcc ttcttatcta tacttccagg acactttttc 2220 
ttccttatga taaggctctc tctctctcca cacacacaca cacacacaca cacacacaca 2280 
cacacacaca cacaaacaca caccccgcca accaaggtgc atgtaaaaag atgtagattc 2340 
ctctgccttt ctcatctaca cagcccagga gggtaagtta atataagagg gatttattgg 2400 
taagagatga tgcttaatct gtttaacact gggcctcaaa gagagaattt cttttcttct 2460 
gtacttatta agcacctatt atgtgttgag cttatatata caaagggtta ttatatgcta 2520 
atatagtaat agtaatggtg gttggtacta tggtaattac cataaaaatt attatccttt 2580 
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taaaataaag ctaattatta ttggatcttt tttagtattc attttatgtt ttttatgttt 2640 
ttgatttttt aaaagacaat ctcaccctgt tacccaggct ggagtgcagt ggtgcaatca 2700 
tagctttctg cagtcttgaa ctcctgggct caagcaatcc tcctgccttg gcctcccaaa 2760 
gtgttgggat acagtcatga gccactgcat ctggcctagg atccatttag attaaaatat 2B20 
gcattttaaa ttttaaaata atatggctaa tttttacctt atgtaatgtg tatactggta 2880 
ataaatctag tttgctgcct aaagtttaaa gtgctttcca ataagcttca tgtacgtgag 2940 
gggagacatt taaagtgaaa cagacagcca ggtgtggtgg ctcacgcctg taatcccagc 3000 
actctgggag gctgaggtgg gtggatcgct tgagccctgg agttcaagac cagcctgagc 3060 
aacatggcaa aaccctgttt ctataacaaa aattagccgg gcatggtggc atgtgcctgt 3120 
ggtcccagct actagggggc tgaggcagga gaatctttgg agcccaggag gtcaaggctg 3180 
cactgagcag tgcttgcgcc actgcactcc agcctgggtg acaggaccag accttgcctc 3240 
aaaaaaataa gaagaaaaat taaaaataaa tggaaacaac tacaaagagc tgttgtccta 3300 
gatgagctac ttagttaggc tgatattttg gtatttaact tttaaagtca gggtctgtca 3360 
cctgcactac attattaaaa tatcaattct caatgtatat ccacacaaag actggtacgt 3420 
gaatgttcat agtaccttta ttcacaaaac cccaaagtag agactatcca aatatccatc 3480 
aacaagtgaa caaataaaca aaatgtgcta tatccatgca atggaatacc accctgcagt 3540 
acaaaggaag aagctacttg gggatgaatc ccaaagtcat gacgctaaat gaaagagtca 3600 
gacatgaagg aggagataat gtatgccata cgaaattcta gaaaatgaaa gtaacttata 3660 
gttacagaaa gcaaatcagg gcaggcatag aggctcacac ctgtaatccc agcactttga 3720 
gaggccacgt gggaagattg ctagaactca ggagttcaag accagcctgg gcaacacagt 3780 
gaaactccat tctccacaaa aatgggaaaa aaagaaagca aatcagtggt tgtcctgtgg 3840 
ggagggsaag gactgcaaag agggaagaag ctctggtggg gtgagggtgg tgattcaggt 3900 
tctgtatcct gactgtggta gcagtttggg gtgtttacat ccaaaaatat tcgtagaatt 3960 
atgcatctta aatgggtgga gtttactgta tgtaaattat acctcaatgt aagaaaaaat 4020 
aatgtgtaag aaaagtttca attctcttgc cagcaaacgt tattcaaatt cctgagccct 4080 
ttacttcgca aattctctgc acttctgccc cgtaccatta ggtgacagca ctagctccac 4140 
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aaattggata aatgcatttc tggaaaagac tagggacaaa atccaggcat cacttgtgct 4200 
ttcatatcaa ccacgctgta cagcttgtgt tgctgtctgc agctgcaatg gggactcttg 4260 
atttctttaa ggaaacttgg gttaccagag tatttccaca aatgctattc aaattagtgc 4320 
ttatgatatg caagacactg tgctaggagc cagaaaacaa agaggaggag aaatcagtca 4380 
ttatgtggga acaacatagc aagatattta gatcattttg actagttaaa aaagcagcag 4440 
agtacaaaat cacacatgca atcagtataa tccaaatcat gtaaatatgt gcctgtagaa 4500 
agactagagg aataaacaca agaatcttaa cagtcattgt cattagacac taagtctaat 4560 
tattattatt agacactatg atatttgaga tttaaaaaat ctttaatatt ttaaaattta 4620 
gagctcttct atttttccat agtattcaag tttgacaatg atcaagtatt actctttctt 4680 
tttttttttt tttttttttt tttgagatgg agttttggtc ttgttgccca tgctggagtg 4740 
gaatggcatg accatagctc actgcaacct ccacctcctg ggttcaagca aagctgtcgc 4800 
ctcagcctcc cgggtagatg ggattacagg cgcccaccac cacactcggc taatgtttgt 4860 
atttttagta gagatggggt ttcaccatgt tggccaggct ggtctcaaac tcctgacctc 4920 
agaggatcca cctgcctcag cctcccaaag tgctgggatt acagatgtag gccactgcgc 4980 
ccggccaagt attgctctta tacattaaaa aacaggtgtg agccactgcg cccagccagg 5040 
tattgctctt atacattaaa aaataggccg gtgcagtggc tcacgcctgt aatcccagca 5100 
ctttgggaag ccaaggcggg cagaacaccc gaggtcagga gtccaaggcc agcctggcca 5160 
agatggtgaa accccgtctc tattaaaaat acaaacatta cctgggcatg atggtgggcg 5220 
cctgtaatcc cagctactca ggaggctgag gcaggaggat ccgcggagcc tggcagatct 5280 
gcctgagcct gggaggttga ggctacagta agccaagatc atgccagtat acttcagcct 5340 
gggcgacaaa gtgagaccgt aacaaaaaaa aaaaaattta aaaaaagaaa tttagatcaa 5400 
gatccaactg taaaaagtgg cctaaacacc acattaaaga gtttggagtt tattctgcag 5460 
gcagaagaga accatcaggg ggtcttcagc atgggaatgg catggtgcac ctggrtttttg 5520 
tgagatcatg gtggtgacag tgtggggaat gttattttgg agggactgga ggcagacaga 5580 
ccggttaaaa ggccagcaca acagataagg aggaagaaga tgagggcttg gaccgaagca 5640 
gagaagagca aacagggaag gtacaaattc aagaaatatt ggggggtttg aatcaacaca 5700 
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tttagatgat taattaaata tgaggactga ggaataagaa atgagtcaag gatggttcca 5760 
ggctgctagg ctgcttacct gaggtggcaa agtcgggagg agtggcagtt taggacaggg 5820 
ggcagttgag gaatattgtt ttgatcattt tgagtttgag gtacaagttg gacacttagg 5880 
taaagactgg aggggaaatc tgaatataca attatgggac tgaggaacaa gtttatttta 5940 
ttttttgttt cgttttcttg ttgaagaaca aatttaattg taatcccaag tcatcagcat 6000 
ctagaagaca gtggcaggag gtgactgtct tgtgggtaag ggtttggggt ccttgatgag 6060 
tatctctcaa ttggccttaa atataagcag gaaaaggagt ttatgatgga ttccaggctc 6120 
agcagggctc aggagggctc aggcagccag cagaggaagt cagagcatct tctttggttt 6180 
agcccaagta atgacttcct taaaaagctg aaggaaaatc cagagtgacc agattataaa 6240 
ctgtactctt gcattttctc tccctcctct cacccacagc ctcttgatga accggaggaa 6300 
gtttctttac caattcaaaa atgtccgctg ggctaagggt cggcgtgaga cctacctgtg 6360 
ctacgtagtg aagaggcgtg acagtgctac atccttttca ctggactttg gttatcttcg 6420 
caataaggta tcaattaaag tcagctttgc aagcagttta atggtcaact gtgagtgctt 6480 
ttagagccac ctgctgatgg tattacttcc atcctttttt ggcatttgtg tctctatcac 6540 
attcctcaaa tccttttttt tatttctttt tccatgtcca tgcacccata ttagacatgg 6600 
cccaaaatat-gtgatttaat tcctccccag taatgctggg caccctaata ccactccttc 6660 
cttcagtgcc aagaacaact gctcccaaac tgtttaccag ctttcctcag catctgaatt 6720 
gcctttgaga ttaattaagc taaaagcatt tttatatggg agaatattat cagcttgtcc 6780 
aagcaaaaat tttaaatgtg aaaaacaaat tgtgtcttaa gcatttttga aaattaagga 6840 
agaagaattt gggaaaaaat taacggtggt tcaattctgt tttccaaatg atttcttttc 6900 
cctcctactc acatgggtcg taggccagtg aatacattca acatggtgat ccccagaaaa 6960 
ctcagagaag cctcggctga tgattaatta aattgatctt tcggctaccc gagagaatta 7020 
catttccaag agacttcttc accaaaatcc agatgggttt acataaactt ctgcccatgg 7080 
gtatctcctc tctcctaaca cgctgtgacg tctgggcttg gtggaatctc agggaagcat 7140 
ccgtggggtg gaaggtcatc gtctggctcg ttgtttgatg gttatattac catgcaattt 7200 
tctttgccta catttgtatt gaatacatcc caatctcctt cctattcggt gacatgacac 7260 
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attctatttc agaaggcttt gattttatca agcactttca tttacttctc atggcagtgc 7320 
ctattacttc tcttacaata cccatctgtc tgctttacca aaatctattt ccccttttca 7380 
gatcctccca aatggtcctc ataaactgtc ctgcctccac ctagtggtcc aggtatattt 7440 
ccacaatgtt acatcaacag gcacttctag ccattttcct tctcaaaagg tgcaaaaagc 7500 
aacttcataa acacaaatta aatcttcggt gaggtagtgt gatgctgctt cctcccaact 7560 
cagcgcactt cgtcttcctc attccacaaa aacccatagc cttccttcac tctgcaggac 7620 
tagtgctgcc aagggttcag ctctacctac tggtgtgctc ttttgagcaa gttgcttagc 7680 
ctctctgtaa cacaaggaca atagctgcaa gcatccccaa agatcattgc aggagacaat 7740 
gactaaggct accagagccg caataaaagt cagtgaattt tagcgt'ggtc ctctctgtct 7800 
ctccagaacg gctgccacgt ggaattgctc ttcctccgct acatctcgga ctgggaccta 7860 
gaccctggcc gctgctaccg cgtcacctgg ttcacctcct ggagcccctg ctacgactgt 7920 
gcccgacatg tggccgactt tctgcgaggg aaccccaacc tcagtctgag gatcttcacc 7980 
gcgcgcctct acttctgtga ggaccgcaag gctgagcccg aggggctgcg gcggctgcac 8040 
cgcgccgggg tgcaaatagc catcatgacc ttcaaaggtg cgaaagggcc ttccgcgcag 8100 
gcgcagtgca gcagcccgca ttcgggattg cgatgcggaa tgaatgagtt agtggggaag 8160 
ctcgagggga agaagtgggc ggggattctg gttcacctct ggairccgaaa ttaaagatta 8220 
gaagcagaga eaagagtgaa tggctcagag acaaggcccc gaggaaatga gaaaatgggg 8280 
ccagggttgc ttctttcccc tcgatttgga acctgaactg tcttctaccc ccatatcccc 8340 
gccttttttt cctttttttt ttttttgaag attattttta ctgctggaat acttttgtag 8400 
aaaaccacga aagaactttc aaagcctggg aagggctgca tgaaaattca gttcgtctct 8460 
ccagacagct tcggcgcatc cttttggtaa ggggcttcct cgctttttaa attttctttc 8520 
tttctctaca gtcttttttg gagtttcgta tatttcttat attttcttat tgttcaatca 8580 
ctctcagttt tcatctgatg aaaactttat ttctcctcca catcagcttt ttcttctgct 8640 
gtttcaccat tcagagccct ctgctaaggt tccttttccc tcccttttct ttcttttgtt 8700 
gtttcacatc tttaaatttc tgtctctccc cagggttgcg tttccttcct ggtcagaatt 8760 
cttttctcct tttttttttt tttttttttt ttttttaaac aaacaaacaa aaaacccaaa 8820 
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aaaactcttt cccaatttac tttcttccaa catgttacaa agccatccac tcagtttaga 8880 
agactctccg gccccaccga cccccaacct cgttttgaag ccattcactc aatttgcttc 8940 
tctctttctc tacagcccct gtatgaggtt gatgacttac gagacgcatt tcgtactttg 9000 
ggactttgat agcaacttcc aggaatgtca cacacgatga aatatctctg ctgaagacag 9060 
tggataaaaa acagtccttc aagtcttctc tgtttttatt cttcaactct cactttctta 9120 
gagtttacag aaaaaatatt tatatacgac tctttaaaaa gatctatgtc ttgaaaatag 9180 
agaaggaaca caggtctggc cagggacgtg ctgcaattgg tgcagttttg aatgcaacat 9240 
tgtcccctac tgggaataac agaactgcag gacctgggag catcctaaag tgtcaacgtt 9300 
tttctatgac ttttaggtag gatgagagca gaaggtagat cctaaaaagc atggtgagag 9360 
gatcaaatgt ttttatatca acatccttta ttatttgatt catttgagtt aacagtggtg 9420 
ttagtgatag atttttctat tcttttccct tgacgtttac tttcaagtaa cacaaactct 9480 
tccatcaggc catgatctat aggacctcct .aatgagagta tctgggtgat tgtgacccca 9540 
aaccatctct ccaaagcatt aatatccaat catgcgctgt atgttttaat cagcagaagc 9600 
atgtttttat gtttgtacaa aagaagattg ttatgggtgg ggatggaggt atagaccatg 9660 
catggtcacc ttcaagctac tttaataaag gatcttaaaa tgggcaggag gactgtgaac 9720 
aagacaccct aataatgggt tgatgtctga agtagcaaat cttctggaaa cgcaaactct 9780 
tttaaggaag tccctaattt agaaacaccc acaaacttca catatcataa ttagcaaaca 9840 
attggaagga agttgcttga atgttgggga gaggaaaatc tattggctct cgtgggtctc 9900 
ttcatctcag aaatgccaat caggtcaagg tttgctacat tttgtatgtg tgtgatgctt 9960 
ctcccaaagg tatattaact atataagaga gttgtgacaa aacagaatga taaagctgcg 10020 
aaccgtggca cacgctcata gttctagctg cttgggaggt tgaggaggga ggatggcttg 10080 
aacacaggtg ttcaaggcca gcctgggcaa cataacaaga tcctgtctct caaaaaaaaa 10140 
aaaflflPflf>ft^ agaaagagag agggccgggc gtggtggctc acgcctgtaa tcccagcact 10200 
ttgggaggcc gagccgggcg gatcacctgt ggtcaggagt ttgagaccag cctggccaac 10260 
atggcaaaac cccgtctgta ctcaaaatgc aaaaattagc caggcgtggt agcaggcacc 10320 
tgtaatccca gctacttggg aggctgaggc aggagaatcg cttgaaccca ggaggtggag 10380 
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gttgcagtaa gctgagatcg tgccgttgca ctccagcctg ggcgacaaga gcaagactct 10440 
gtctcagaaa aaaaaaaaaa aaagagagag agagagaaag agaacaatat ttgggagaga 10500 
ageatgggga agcattgcaa ggaaattgtg ctttatccaa caaaatgtaa ggagccaata 10560 
agggatccct atttgtctct tttggtgtct atttgtccct aacaactgtc tttgacagtg 10620 
agaaaaatat tcagaataac catatccctg tgccgttatt acctagcaac ccttgcaatg 10680 
aagatgagca gatccacagg aaaacttgaa tgcacaactg tcttatttta atcttattgt 10740 
acataagttt gtaaaagagt taaaaattgt tacttcatgt attcatttat attttatatt 10800 
attttgcgtc taatgatttt ttattaacat gatttccttt tctgatatat tgaaatggag 10860 
tctcaaagct tcataaattt ataactttag aaatgattct aataacaacg tatgtaattg 10920 
taacattgca gtaatggtgc tacgaagcca tttctcttga tttttagtaa acttttatga 10980 
cagcaaattt gcttctggct cactttcaat cagttaaata aatgataaat aattttggaa 11040 
gctgtgaaga taaaatacca aataaaataa tataaaagtg atttatatga agttaaaata 11100 
aaaaatcagt atgatggaat aaacttgaga gtccagaagt tatcccatac atctgtaatc 11160 
aactaatttc tcacaagggt gtaaggacca ttcaatggag aaaa 11204 



35 Claims 

1 . A DNA or a fragment thereof encoding a protein connprising the amino acid sequence of SEQ ID NO: 2 or 8. 

2. The DNA or the fragment of claim 1 , wherein the protein has a cytidine deaminase activity. 

40 

3. A DNA or a fragment thereof comprising the nucleotide sequence of SEQ ID NO: 1 or 7. 

4. A DNA or a fragment thereof comprising a nucleotide sequence of (a) or (b) below: 

45 (a) a nucleotide sequence comprising the nucleotide residues 93 to 689 of SEQ ID NO: 1 or 

(b) a nucleotide sequence comprising the nucleotide residues 80 to 676 of SEQ ID NO: 7. 

5. A DNA or a fragment of (a) or (b) below: 

so (a) a DNA or afragment thereof that hybridizes under stringent conditions with a DNA comprising the nucleotide 

sequence of SEQ ID NO: 1 and that encodes a mammal-derived protein being homologous to a protein that 
comprises the amino acid sequence of SEQ ID NO: 2 and having a cytidine deaminase activity or 
(b) a DNA or a fragment thereof that hybridizes uriderstringent conditions with a DNA comprising the nucleotide 
sequence of SEQ ID NO: 7 and that encodes a mammal-derived protein being homologous to a protein that 

S5 comprises the amino acid sequence of SEQ ID NO: 8 and having a cytidine deaminase activity. 

6. A protein or a fragment thereof comprising the amino acid sequence of SEQ ID NO: 2 or B. 
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7. A protein or a fragment thereof comprising substantially the same amino acid sequence as that of SEQ ID NO: 2 
or 6 and having a cytidine deaminase activity. 

8. A protein of (a) or (b) below: 

5 

(a) a mammat-derived protein that comprises an amino acid sequence encoded by a DNA hybridizing under 
stringent conditions with a DNA comprising the nucleotide sequence of SEQ ID NO: 1 , that is homologous to 
a protein comprising the amino acid sequence of SEQ ID NO: 2, and that has a cytidine deaminase activity or 

(b) a mammal-derived protein that comprises an amino acid sequence encoded by a DNA hybridizing under 
10 stringent conditions with a DNA comprising the nucleotide sequence of SEQ ID NO: 7, that is homologous to 

a protein comprising the amino acid sequence of SEQ ID NO: B. and that has a cytidine deaminase activity. 

9. An expression vector comprising the DNA or the fragment of any one of claims 1 to 5. 
IB 10. A transfomnant transfomied with the expression vector of claim 9. 

11. An antibody or a portion thereof reactive to the protein of any one of claims 6 to B or to a fragment of the protein. 

12. The antibody or the portion of claim 11 , wherein the antibody is a monoclonal antibody. 

20 

13. A phamnaceutical conposition comprising the antibody or the portion of claim 11 or 12, and a pharmaceutically 
acceptable carrier. 

14. A cell producing a monoclonal antibody reactive to the protein of any one of claims 6 to 8 or to a fragment of the 

25 protein. 

15. The cell of claim 14, wherein the cell is a hybridoma obtained by fusing, with a mammal-derived myeloma cell, a 
non-human manrvnal-derived B cell that pmduces a monoclonal antibody. 

30 16. The cell of claim 15, wherein the cell Is a transgenic cell transformed by introducing, into a cell, either or both of 
a DNA encoding a heavy chain of the monoclonal antibody and a DNA encoding a light chain of the monoclonal 
antibody. 

17. A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one of (a) to (c) below: 

35 

(a) SEQ ID NO: 9. 

(b) SEQ ID NO: 10. or 

(c) SEQ ID NO: 35. 

40 18. A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one of (a) to (e) below: 

(a) SEQ ID NO: 11. 

(b) SEQ ID NO: 12. 

(c) SEQ ID NO: 13. 
45 (d) SEQ ID NO: 14. or 

(e) SEQ ID NO: 15. 

19. A DNA comprising a complementary nucleotide sequence to an ariaitrary partial nucleotide sequence of a nucle- 
otide sequence of any one of (a) to (h) below: 

50 

(a) SEQ ID NO: 9, 

(b) SEQ ID NO: 10. 

(c) SEQ ID NO: 11. 

(d) SEQ ID NO: 12, 
55 (e) SEQ ID NO: 13, 

(0 SEQ ID NO: 14, 

(g) SEQ ID NO: 15, or 

(h) SEQ ID NO: 35. 
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20. The DNA of claim 19. wherein the DNA comprises a nucleotide sequence of any one of (a) to (q) below: 

(a) SEQ ID NO: IB. 

(b) SEQ ID NO: 19, 
5 (c) SEQ ID NO: 20, 

(d) SEQ ID NO: 21, 

(e) SEQ ID NO: 22. 

(f) SEQ ID NO: 23. 

(g) SEQ ID NO: 24. 
10 (h) SEQ ID NO: 25, 

(i) SEQ ID NO: 26. 
0) SEQ ID NO: 27. 
(lc)SEQ ID N0:2B. 
(I) SEQ ID NO: 29, 
15 (m) SEQ ID NO: 30, 

(n) SEQ ID NO: 31, 

(0) SEQ ID NO: 32. 
(p) SEQ ID NO: 33. or 
(q) SEQ ID NO: 34. 

20 

21. Use of the DNA of claim 1 9 or 20 as a primer DNA in polymerase chain reaction. 

22. Use of a pair of DNA of any one of (a) to (n) below as primer DNAs in polymerase chain reaction: 

25 (a) a DNA comprising the nucleotide sequence of SEQ ID NO: 31 and a DNA comprising the nucleotide se- 

quence of SEQ ID NO: 32, 

(b) a DNA comprising the nucleotide sequence of SEQ ID NO: 20 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 22, 

(c) a DNA comprising the nucleotide sequence of SEQ ID NO: 21 and a DNA comprising the nucleotide se- 
30 quence of SEQ ID NO: 30, 

(d) a DNA comprising the nucleotide sequence of SEQ ID NO: 24 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 25. 

(e) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 27, 

35 (f) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide se- 

quence of SEQ ID NO: 28, 

(g) a DNA comprising the nucleotide sequence of SEQ ID NO: 23 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 29, 

(h) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide se- 
40 quence of SEQ ID NO: 27, 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 28, 

(j) a DNA comprising the nucleotide sequence of SEQ ID NO: 26 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 29. 

<5 (k) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA comprising the nucleotide se- 

quence of SEQ ID NO: 28, 

(I) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 29, 

(m) a DNA comprising the nucleotide sequence of SEQ ID NO: 33 and a DNA comprising the nucleotide 
50 sequence of SEQ ID NO: 29, or 

(n) a DNA comprising the nucleotide sequence of SEQ ID NO: IB and a DNA comprising the nucleotide se- 
quence of SEQ ID NO: 19. 

23. A method for identifying a substance that regulates transcription of a gene encoding an AID protein comprising 
55 the amino acid sequence of SEQ ID NO: 2 or B Into mRNA. or production of the AID protein, the method comprising 

the steps of: 

(a) culturing, separately in the presence and the absence of the substance, cells producing the AID protein and 
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(b) (i) comparing the level of the AID protein produced by the cells cultured In the presence of the substance 
with the level of the AID protein produced by the celts cultured in the absence of the substance or 

(11) comparing the level of the AID protein-encoding mRNA transcribed in the cells cultured in the presence 
of the substance with the level of the AID protein-encoding mRNA transcribed in the cells cultured in the 
5 absence of the substance. 

24. A method for identifying a substance that regulates transcription of a gene encoding an AID protein comprising 
the amino acid sequence of SEQ ID NO: 2 or 8 into mRNA, or production of the AID protein, the method comprising 
the steps of: 

10 

(a) cutturing, separately in the presence and the absence of the substance, cells producing the AID protein 
and a protein other than the AID protein, wherein transcription of a gene encoding the other protein into mRNA 
is dependent in the cells on the degree of a signal of transcription of the gene encoding the AID protein into 
mRNA and 

15 (b) comparing the level of the other protein produced by the ceils cultured in the presence of the substance 

with the level of the other protein produced by the cells cultured in the absence of the substance. 

25. The method of claim 23 or 24, wherein the cells are transgenic cells transformed with a gene encoding the protein. 

20 26. The method of claim 24, wherein the cells are transgenic cells transformed with a gene encoding the protein and 
a gene encoding the other protein. 

27. The method of claim 26, wherein the protein is a reporter protein. 

25 28. The method of claim 27. wherein comparison of the level of the other protein is comparison of the level of a signal 
generated by the reporter protein. 

29. The method of claim 27 or 28, wherein the reporter protein is tuciferase. 

30 30. A method for identifying a substance that Inhibits an enzyme activity of an AID protein comprising the amino acid 
sequence of SEQ ID NO: 2 or 8, the method comprising the step of (a) or (b) below: 

(a) culturing, separately In the presence and the absence of the substance, mammal-derived B cells or tissues 
comprising the B cells, and comparing enzyme activities of the AID proteins in the B cells separately cultured or 
35 (b) (1) administering the substance separately to an AID gene knockout mouse whose endogenous AID gene 

is inactivated so that transcription of the endogenous AID gene into mRNA is inhibited, and to a normal mouse 
and 

(11) comparing enzyme activities of the AID proteins in the B cells isolated from the respective mk;e. 
40 31 . The method of claim 30. wherein the enzyme activity is a cytidine deaminase activity. 
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Figure 2 
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Figure 3 
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Figure 5 
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Figure 6 
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Figure 7 
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Figure 8 
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Figure 25 
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